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IJ3)VELTY - Generatirig a Prequency distribution (FD) of scores (S), related 
n.-> random idontif loat ions of biol^jgical mc le^-ules C), for a particular 
eMporiniental condL*:iori (EC), is new. 

DETAILECi LiESCRI PTI'PII - Generaimg a irequenoy ^Ij.ot r ibution [FD) of 
scores (S), related to i-and-.m i denr i f loap : 'r.-ns of bijlogical molecules (I), 
for a particular exp>erimental condition (EC), is new. The method 
mp rises : 

(a) generating inass data at EC, f-i-r Pn<jwn (I) in a database, and for 
a hypothe t iCdl (Ij; 

ip)) comparing the data to calculate P tna:: is a function of 
the similar Lty of the data; 

(c) selecting S tnat r'ji>resent a higr: riegree oP similarity; 

(i-l) repea^-ing the meth>.M-i for other nyp'jt net i :a 1 (I); and 

(e) determining the fre^qaen.--/ of -el- in g eacn S to generate FD. 

IIIDEPEUPEIIT CLAIMS are alS'.:) incPi-u:ied ror the frjiPowing: 

{1} the n<jvel :iietn-.'d In which the hypi'the t i ::al ;r) in the first step 
is replaced oy a (1} random Fy selected from the aatal;)ase; 

(3) the n'jvel metht-d ftr identifying, at a tiarticular significance 
level, an u n P.n o wn ( F } ; 

{'i) determining statistical signLficance of an ilentified score for 

{ 1) ; 

(4) FD prepared oy the novel metnod; 

(5) identifying a -I) at a particular EC and significance level, 
2 .)m.pri3 in g : 

(a) selecting a significance level that represents a level of 
confidence in a biological m.ole::ule identification; 
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fb) -leaving the unknown bi:..loqicai mviecule into c.aiotituent 
I-) iieneratm^ ma^s data 1-r ^.nese c-.stituent p.r^.:,; 

'omparing tne mass dat . w:.th mass data genersit.^d l or tne 
known bioL-'gioai moleouies of a dai.ab/.so; 

-..loulating 3-;;ores for ^.aoh oomparison, whica ar.e a 
function -A similarity betw^.on n.ss d.ta or the unknown and known 

'''^'''''fJj^:>-eoting a sooro generat-d :n (e) whioh oorresponds to a 
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OMmparison wnich denotes a nighi degre* 
c.rresponds to a particular known nol- 
(ci) o-'mt:aring the scc-re selected 
distriL.utLjn relates t :: r^. n i :)m r-iol sg 
has a cri":c=:I score associuted 'w^tn 
s..'-jnif ican-;e level; an J 



■ if ;>imilarity, 
■sole; 

in [t) with F'J 
. t, vshioh corre.:i; 



and v^h i rh 

jf scores for EC, the 
^ n*: i f :-CLit ic^ns , and 
jnos t'S L he 



(h) 
utioal 
(0) 
c (I), 
(a) 

con fide no 



^determining 



the 



; IS equ:i 
qni f icano 



t 



fo: 



deterniining the statistical 
comprising: 

selectina a sigiii f i canoe lev-^ tnat repre:: 
^'in a biological molecule identification; 
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(b) calculating a sos-re asso-olat 
m-lecule, the score Is a function of 
data of tr.e unknown ofid known n>slecui 
:■-) 'rojmparinq the S'::jre with a s 
generated by romparing mass data of h 
mass data generated for known molecules, ana t 
level assc''tiated witfi it; arid 

(a) det- tuain^ijq if tne so 
identification is within th.e s 
(7) a computer {_-iogran. sy 
USE - The methoo is used 



a with an unKri'jwn t-iol' 
Lmilarlty t-etween r:ass 
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ue FD, t-he jos^rilruti 
;;)tneticai b 1 i c^g i Cd ... it'jief.siits wj 
FD nas a s i :;ni f icance 

o associated wi^h tne un-mowr: mDlecule 
n -1 f i c .1 n e 1 e v e 1 ; s r, i 
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ir;ieri^ify unkn-jwn 
proteins, nurleic acids ana pel yscccnarid^rs , f r ; m ^ss 
spectroscopy data and to assess the reliability or tne 
C'wg. 0/14 
CEI EE'I 
Ad; DCN 

CPI: B.04-C02; B04-E01; B04-N04. 
B12-K04E; D05-H09 

KP'I: TOl-J 

UPTX: 20010107 . 
TECHNOLOGY ^mCUS - I HSTRtMFl ITATION AhD TESTItlG - Preferred Process: FD is 
particularly a prc.bability density function and mass data ar.. 
^,:^e;ated computer or, for hypothetical (1). by mass spectrometry. The 
method IS used to generate 1-lClO, particularly l^'''^" ^ ^ : ^^J^,.. ^ 

dPfines mass data results from chemical degraoation .f (I) - oai t icuiai ..y 
'n'ymaUc aggradation with trypsin), including the effectiveness of the 
degradation, 'or a particular accuracy for mass determination, .he 
'omparison ;f data is Umi.ed to known (T) within -l-^^^^^"-^^;/^ 
.^■th-r mass or isoelectric point, especially to known _ ( I ) th<.t are 
".-idified e -1 for proteins pes t- 1 rans 1 at ion al 1 y mooitiea. . 
...diLieo,^.! _^^r P h selected rrom the 

C'-'n^rate hyr-ot he^t ic.a ^ i j , at i ..d^. l . - --w ^ 

tit.;bas^ fpr«ferablY in the mass ran.ie up to X MD) , n a.s aat . generated 
^d tho'-Cr^'^.es selecte.i that correspond to ..t least one constituent part 

the'no^n at. The procedure is repeated for different (H un rl 
sufficient masses (particularly i-iOuO) have been se.ected^_o alio, 
generation cf hypothetic.] (1; whicn consists o. a < t^^...s....ce..t 

carts that is different frcm all sets or constituent:: .n l-^-'" 
'^elec-i.-n of ;i) and of the constituent parts is ran.^cm. trajrr.ent n.ass 
t.tahna. alC be collectec for at least cne constituent part of Known and 



b vnnth*^- - .-al fT), and in thiL ... -- . . ^ 

da?;'and*E.ythen defines the energy used to generate tt---; ff-^' -^^^ 
..^^yr-i a sicnificc-.r.ce level is chosen that represen.o a le^,el of 

:;.onfiden.;ei;; Identification .:f (I), tnen unhnown * ^ t '^^^-f^^ ,f ^^^.f ^ 
constituent parts (especially in a predi .table ^^"^^"^^ ' / '.'f f 
■generated for these parts. These data are then compared -ith data f cm 
known (I) to generate an which is compared with an rD uf scores, at EC, 



case the C'jmoarisC'n i. 



:ween fragm.ent mass 



4 ^^^4 01 



that IS related tC' randoiu identii icat ion oi (1^ arid associatea v;ith a 
crLti<:al secure corre.spondiriq to the s i gn l f icari^ :e level. The unknovm (I) 
may be a mixture. The fraament msiS:: data ire ci::quir-":i vibrati'jiial or 
electronic excitation, spM-c i t lea 1 ly collLsion;; v/itn el ect rc-n.;- , phetc^ns, 
qac molecules or a curf ace. In tne method of (3), frcm an unknc^wn (I) is 
C'-mpared with an FT', a3C':'Oi.id;eo v;ioh a f>reuetermin*.;d 3 1 gni f i<;:ance level 
an^i generate':! t-y ccmparin-^i nuiso data 1 rem -inov/n and hypothetical il). Any 
S within the ciqn.i t i cance lev-1 (\;r.ich p.ar i icularly der.c.tes prol.-abi lity 
tr:at 1 -ienti t L :at ion '"t m) is laVsw- is de t:erj:i ined . 
Preferred ['hi t ial s : (1; ate proteins, nu:leic acid:;^ or 
O'!' Lvsac'::har ides . 
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■li^veLjpiuent , using f ro-:o-ss>jr correiat-j data re:;ar:d.ny e.g. ■::'el] 
r-opulat ions, soluble factor levels and clinioiL parameters. 
Ck: B'M :de J04 S'JR TOl 
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AB Wo' 200(1(55472 A UPAR: 20O(il^C0 

N'.'VELTY - A biological marker (3M) ident if icat ic-n system comprising an 
integrated database containing several data categories and data from 
various organisms corresponding to the data categ^jries, is new. 

DETAILED DESCR 1 PI'lOtJ - A biol':)gical rtarker ( B:-] ) idenlx f ±cat^on system 
(::ompri ses : 

(a) an integrated database containing several data categories, i.e. 
levels of cell populations, cell associated molecules and/or soluble 
factors m a biological fluid and inform^ation associated witn clinical 
parameters of an organism; and 

(0) data rrom various organisms corresponding to the data cate'-jcr ies . 
A processor correlates data withm the categories, to ilentify the 

':ategory (or categories) indicating n^trmal bi'tlogical or pathogenic 
processes or responses to drug intervention; chis category ( 3r these 
c5:tegories) is /are BM, 

IIJDEPENDENT CLAIMS are also m^-ludeo for tne fcOlowmg: 

(1) a method for identifying a BM fcr a o'iven oisease c^r medical 
cc^ndition (GD/MC) , C'lmprismg ct^r r e iat in^g inf urmat i on associated with 
several categories (as in (a) ab'Dve) frcm several organism.s, at least some 
of which have GD/M'.: and identifym^g a data cntegory (i.e. BM ) by which the 
presen'.:e of 'OD/MC can by lOentified; 

(2} a f'henotype -A an organism c-T^mp' r i s i nq several bloFtgical 
p-arameters, i.e. the results C'f at leas: lOO (preferahiy at least 40) 
^ssa-/s relating t^j coll io:r.ul at i ■: ns ar.j/:r tell as ;: ■:-ci E:ted ii.olecuLes, the 
results of at least 20 (preferaoly at Least 40) assays relating to soluble 
factors ana clinical parameters; 

■;3) a pnenotyp'e ^-f 3 class ji suoolass :f organisms, cc^mprisina 
oaramieters as in (2} fir each m.emrer; 

t4) a system for ^rr eating the phenttype of an organism, inv-tlving 
ibtaining parameters as in (2); 

i5} a method for evaluating *: he effect of a perturbation on an 
■trganism (or on a class or subtlass of organisms), involving comparing the 
information in the fhenotype, as in (2) or (0), of the organism{s) before 
and after the perturbat icn; 



a system for the i d.nt il icat i..n :t BM's ,t a 'HC^J;- . ^^^^^^^ 

0. t.J..nxn. If a BM-.n an .nx..l " .o:ie: as xn xs dxa,.osr.c or 

pvogno.tic ot ;::i/MC a h.man; ,n..'.olvxn.:, treating an 

^ 5:;:? :S : a:'n<rth.- 'ffoc-: n . m as :n (.); 

3nr.a. n.oa^-r ..... . ^^.^ ^^^^ ^,.^3,,,3 , ..udy xn nun.ans 

;Gr,;,c:' involving x..:ntifyrn, ' . .n a h.ran ..x :n are nc.uox.,. 
Bir. x.i.nt:f.ea in ,.:arr ai j-:C aei. t ^^-^-^^^ ^^^^.^^^ j^^^. ^ cD/MC in 

h.arr,an:-, xnvo.-.'ir.g x.:lvi:t 1 1 y i ng h-i:n.: B.. y,-,!--:; ll . ^ elevating 

1. d^xUi^I.^J --i.'^ ^ .... - ~ ^ f t p- rii-Tinr Pt-I; 

;^ ,n-^ - t-=>- denti tymc; .iM:ai^i t-ju^im ^ ■.^■/-^, ^ ■ - -- 
^''^ / ^. r-i- r-"'tent ia: animal niodel^ and an 

;.:J^^:i::;\avinriK aVMCr^ridenti most ..los.iy si.uiat:ng anx.ai 

,'^^^,r.r;-:-n .^;^;^Tnnr?,r^.nxLl"lnl;x;ing comparing .nfcr.aatxcn 
'^fpH:^.-,^i;:s1i; xn ::))dor tne qen.tx:aUy altered ..r.d non-alterea 

,rgan.s.> t.- xder.t.fy ■-'^■^^•■■.r^'^ ,,,, deve l.opiaent . The 

,0, _ The .y.-.:n. '^^^^ ^■^'^^ ;^ ^Jcll 'r^,.:,-,.,, p.altxple sclero.xs or 

htnctyoes n.ay r. of hu.ans, anim.is, "^■^^4:- 1.- '^Ue^ai on 

' -,i^,c. be u?ea f-r evaluatxng tne evfe.to jx a '_.^.i.-x.- 



they I'' ay a_.; 

cr, a rdant or anxr.a.. _ ^_ ^ ^_ auantitatxve, 

A!:-7A::7AG^ - in- te.n:. .-g, -^:-y;-,:;^^;rd-,,,^,^.ie'and diverse 

" The biolo.qical parameters ran be xdentxrxex trciVL 



develop-iT.erit is p'jSh- icd- 

small samples of blood. i . s.-r^-mat ic representation 

DES'^FIPTIMII OF DP-AWI1:G(S) - Tne li'^iui^ lo a s...n..ma lo p 
... .V ... ^nf,-.r.,a-ion assir^Hated to obram a ciologicl maik.r 

i iont if icat .I'on system. 
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data for levels of ..ell P^PU-tx.ns and/o ....^ ^^^^^^ 
obtaxned by^.xcrovo a.e l..er s^^^^^^ 

soluole t.zi'-L'^r^ ^ir- ^ui.^) . 1 -ninunoa^savs or niass 

... .-^^r-H h" -i-ro^^'-'^ urn- Laser scanning .y.'^'m^.ry, i.nmunoa^ . c:. , 

(jotdiri-ra oy — ^ - _ . 1 ^ - op^riotvte information. 

,,f.ectr,«».y. Th..;._.H-.^e=y';.,»xe,; „ „ s . J.-^ ^^^^ 

cateqori-ation of ■■;d/MC. 
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a;fining a set of all theoretical target surraces. 
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i.n r-:-:':: ibl--? Earqen '.i , -irir; "iefini.ng a i .il-^-V 'inuinerat oM i^et of 

r neoi'ei ic.i L target u i: i: -I'-e^ lincj^f t:he (.lefm^K: i::on:sM axnl j , .io Eh^E eacli 

■■ ha3 a d^rfiri';.], :r i "I'l^. vclui'ie <iri<"i ■! cie f :.. rif^d^ ^ ic-rr :.:.;.ioa3 surface 

-":re"i, 13 new. 

EETAILED EESEEIETIOM - A :;ipuEer-fca3eM :':eth'-):i cc^mr 3 1 [!■ i d';^fiii:rig a 
3er .ji: constraints p:>3s:r;i^i target 3urfa3e3, an i defining a fully 
enuri.-ia::ed aet ■: f t r:ooi:e t .'i-a 1. t-^rget surfaces ai.d*-: the ieimed 
-cci3traint3, so tr:at -aon 3ur fa-re has -i define'-!, cent Inn -rjs v'^xlume and a 
'i'^fme'-i, oontlnuoas 3nrfuee ar^-*-:!, is new. Tne inetrMd furtner- eompriaes 
ii-inoir:g sets of oEni-'ts • j tMv- lully enumer it e :i set :^f n^e: tt-t icil target 
;.-ur'f r--. l: d^ :M n^ ' ■-.e: r es r:o ndi r . :^ sunsets if 1 he fuily enune ra ed set of 
me'jr^-^isil taroe*: snrfieis, m-i analyj^na an i;-peot if: divei.:::.ty e-f the 
l^j^-:'rs rased --n ae,;t':-rs ii s .ai_ I .ir _ t _ ^n i .n. :r f e varvies an:n:j the 
cia:resf"inding suE.sets. 

An IIJC'EPEEEEICT '"^EAIM s il^'i inclutleii :c a eiii:i:^t-r taijranrr^ed to 
,n.f;i.>-n-n r:-^ tne oherrnoil sirr.ilari^y of d^ff-ren: i!io..e:'^ le.- , i h- pri-grait 
e r^rr pr : i ng : 

nitu:' :ii i it: : n .1 t:";e sis!E:":-5 shipe :)f : I*--cu Le-^ i: int--:i;'S*: tsy 
■i:M..n.j 1 seri'.^s sates, i i :h t-iviri;: a :iiriu-::s Ion E, zir^ E:>'; -it ii^ns I'f the 
eaces^'elng :ieterni:ned oy the -.taliulatel ele.-ttL.n or toil: i i t y density of^ 
the mdlviiiaai mcdeoui- f int-'^r^st, *:m ::h c^^L— sharm:; ..^t ie;ust jne of its 
SIX fares with anotner suh.e, s-j that there Is a sperlfir r.uriher of linked 
cuz-^.i3 which varies for each rnolerule of inteiest; 

[b) approxirr.at mg the ■rnemi'ral reactivity 'if eich incMs/idual molecule 
of interest oy assignirig eaon oune of each In^iividual niole-rule ol 
interest, no more than one functionality value frt>m M different 
chemical functionality values; 

(c) approximating the surface shape and chemical reactivity of a 
chem.ically active surface navln;i a volume equal to V iDy subtracting a 
number V/R3 cubes of dimensiso F from a surfr.ce, where each oune space 
shares at least one face witn an.ither cube s[.ace an-:i where U cute spaces 
have one of M different ■rhemical functionality values; 

calculating ari attraction value K f':r each molecule of interest 
ts the chemically arrive surface; arid 

{e) calculating a list of i.verall attra-rtion values to the chemically 
active surface. 

:;EE - For analyzing mcd.ecuie and protein aiversity. 
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Etl: St3-E14HE; SOE-^IEE; TEl-Jt-A; T01-JC7B 
TECH t^FTE : ^iXittlEi:: 

TErHEIMM'GY F^jC'JS - E T tiTEGHM' 'LC A:' - Ereferred [Materials: Tn^- -aroet 
surtaces comp-rlse n^^:rati';e space target surf^i-es. The cnject.-; cc^mprise 
O'-sitive space :4:'j-c^ surfaces zissociateo wl:n different molerules. The 
oE-ects^ comprise p..-sitrv- sr-ac^- ibjet: surfa^^:-s assC'Ciatec with different 
molecules. Tne oEjerts a:^ mapp^:;C by oeflnin-i c Dr resrt: n :i i ng sufc.sets i)f the 
set of neqative sp:;r- t h-i : reti r i target surfioes 1l whir;! ir.e pt>sltive 
scuice ot^ert surfa-r--r3 :f ronf c-rmat i :ns of nioiecule^ -re t :mc I ementary . The 
aspett of ■:iiversity tn::t is anfjlyzei is the -iif f eren::e :r si:tilarity 
between the m:le rules, which rrep to those ne-:{itive sp^.ce the':retical 
target surfaces. The oty;etts cc^mprise negative sp^ace otgect surfaces 
ass(-ciated with different proteins. The objects are mapped by 



detming ■ -or resp-:'riair;q ti-:;:^L- l.;^ xu^^^ . ..... ^ ... 

space the.reMcai ta] uet .:..rJ . -...^ to which the r.cqati-- -pace (bject 

swrfanes ■:■ t i-.he protein f ■ . rcf ^:ie 3:_rni]ar. The .ispe^:-'. < c: 

d.ver-ity that is arialy-^:.i is r ne dif:!erence ■. r ;:iniL::-i-y t etweeri 

protein p-ckets, wnich iii-M- to t ncse negativ(> sp^^'.:e th^^-..ret. icai 

-irg-^ 3u:rves. Al t -i n r" i ■ y , the -ijec-t::^ c-;.mprL:;e pcsicive r-F^ace object 

siirfa^-BS a;:3-ciatK^i w—h crf-ient rcleauie;; and rK-g^Mve sj-ace c^h:ect 

surffices iSaoci.at-d warn ditr-ient proteins. The ■.)h j e^ t s 

-^■jmpr' p ' " ' '^e apace '-r^ec: p\u ta-.-es associatr-i w i • . r. d^tiei:>Trd. 

^^..,1^. ,;n,Vneq.t.iv.. C|.:e nert: .-•rfacea ..a-". -a--"! w-:h d.ftr-rerit 

proteins. Iri the case '^i r:.- le 71.1 ] es , ^d.e <:b]e:c.a .rs-.- :n,i;.f'-d by 

a-.-! : ri..a:j — i-ir.J --^ ■■ ■ ' .-.-^.^ --i-^. . - ^ - ^ • - ■■ ' ^ 

,j,a::e -he, lexical tare.- a--:. .a- to whi-h tc. F^CTitic-. cT.a-.'e ..b:-er:t 

.,jj.f^^,.c. r.r .-orifcrrnatisa.s ■'■t rcc-'-uiea ar- '^s-np. '-itiea t-. :, y . ^n ti.e CaSe of 

proteins, rn-^ ai^e^ta ai. \.y :i-fir.ir:a -aai r-sp :c ,d:i n,i auo..e^:r _ 

: tae tu.clv -nar.erated s^: r.i aeaaai-e sr.u.:e th.-vie::caL rorq^t aari:ices 
r,: whicn r.^piMve ara're a.rtaC'.js f protein p;veets are 

-i'a^'!a>" Ta^- r h^-.ret i^va i tara^t sarfhtces coniFaaLse polyriedrans . The ob]ects 
-.-.mp-is- r- 1 yh-arans. Thv pciyaearona ro-prd.s- cubes, whica are aii L-t the 
■iaiae size' .ad shape. The s.t of all theorerical target, surfaces d.-ines a 
diversity spa^:e within w:.:ch tne diversity o! obje.-:ts can oe rneasareri oy 
y--ir>fin:i tri'Sse iec-. s ta- div-'jisity space. 

p;;.^o>:::ea [hrt h.svTS : T:i-r ^s . in:iu':ies i -ena i :y :^ag re';T..<.aiS oz tae 

ice tS' wnicn n''. .d.-ie-.c:3 rap. Tae i;ie*::-iod <;i . so irrj^ades 
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gai^ri'.^ a ■■id'i'C- i.e s an.:i"_ . .:ap_. 



t^r.jedsuLfac.it'ofahed..v.T:^hy sp^ir-a. 'Jonp-.m-nt an r y 13 ,..s.iooia-;ed with 
tmoma .atrinirie^ -f p::/.*.--r ^^uice ofc--7- :.ut:f:iCos -f confirmations of 
r' --I- :ule? t n>:-':i.--ir ive -rc-^ioo -.n-o.r-.-ci : dl tGf;f. .?. ta^r^^s . The .;ori3t ra LP.ts 
o'rir.rise ^oIj:^^^^ Tn^i or S- 1 i:air.r ; ;orr:oriso as- : c ; at oach .OLto ot the 

taiaec 3.i.race v:itn s r r e ; /.^d-:.-;. .j l a. .r....... .^a... 

. ,^-.^l..^r.- i :r...:e-:.a:ar -:r:er^i-s is :rawn ir-r: . L.-.ier set; a.: ooss .o.e 
irc.deculdr pisp.itiea. e , j . hydropastir, p: la r i .at a e . H-bond accep-3r, ^ 
H-b.-'Pd d'-a-ioi, H-t:on-.:i ds.ns-r / acc-spt^T, cotentiaiiy a^::^ 1 1 .v- x v ..h.^rg-'.. -.r. 
p.-.r^ntially negatively ch.ig-:l. Not all of the sites of the target surface 
are each associated wi.rh a different one of the molecalar properties and 
a'l '-f ta-^ ca-her sites ot the tar^Tet surface aie asst-ciated with a ccimon 
molecular 'property. The corrjnon molecular piof:-rr'., i. slight 
hydroch-obi-ity. The decrees of similarities or di f f er ^.nces comprise 
functional properties associated with the cor responai ng subsets of 
rb^ fuU- --lume-ated set ■-.f tneoretiral target surfaces. The degrees of 
similarities or differences comprise shape properties associated with the 
sorrespc.ndmg subsets of the fully enumeratea sec or theoretical target 
surfaces. The method further camprises defining each .-.f the oojects by 
quantifying molecules into polyhedrons. 

L123 AKSWER 4 'JF 73 WFI/: t-.tpvpgcHT 2001 OEF^WEUT INFORMATION LTD 

AN iJCQij- (MS^'f [04' WPI/' 

DNN NaCOC;-4^?U^:3 bNa i.:aO- f 0 1 a 00 

TT T.ientifvin^ nove:L nude: : ucid molecules encodmc proteins of 

,r,teiest, and natural l^naea je rutcessinq ano extraction ot relational 
mftc-mat ion associated witn g^rios anci proteins foun^i m i.:.>urnal 
■J It : cles . 

DC b0 4 DIO rO':- TOl ^ 
IN FKIFOMAN, C; 'KALACH IPvry , ; P.'rA., P; KI-ALTHAHMEP , H tc P:HET£-Fi, A 

PA yiV-"^(:); KdlV Ct'LdMPTA NFV; V'sPr' 

CYC 1 ■ 1 ^ 

PI : uCi'd-ir aia^-v A1 _C'Ohl::C.. f Oa::.0^;.4 } - FN .v"4p C';lM'C-d-aO 

■ RW:'aT^BF CH CV DP DK FA FC -1 FF CP CH CP IE IT KE FF FN KC KW NL 

OA FT SD CF 3F CF TF CC FW 

P^ F' FI 3F '3^ CF 'I'A HP Ht: It IL IF iC IF FF r:C r'P r..^ ..C ^F 

bP FF LT Lti LV MA bv:^ v/:. MF Mi: :':w MX N ^ IJF PL rd PO PU SF FE CC SI 

Fr' FF TJ T!-1 TP. TT TF -FA ''G FF VN FU Z- Z'F 
/a- FOO0F4>^t^;. A a tO:;' 1 1 ^'^ '^a-)I-7. Ifi IN ■ ■ 1 - 0'- 

ADT a0000..:-b37 Al WC- FOCO-FFIcyjj f::VJ0414; A^: Flu004--t56 A AU F000-4355b 
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FDT AU 2000 "'43556 A Ba^;eci on WU ::0u063bfr7 

PRAI US ;'.:^9'^- 327 933 19^90608; n.; 12^^:^9- i 2 94 6 19^904 1 5 
IC I("M '3IJ1N03 1-0 0 

ICS G06F015-00; G06F017-00 
AB W) 200O*;36F.7 A UE'AB: 2000120.-^ 

NoVFLTY - lientil'y-ng novel nucL.fA.c .icids riolec:ales encc-dmg a 
protexn of interest, using regulatory nel.w-)rko, new. 

:)KTAILH:D DESCRIPTIOII - Ment,! ty mg novel nuclei - acids molecules 
eMC',)dinj a protein of intereat, using regu:.at.:ry ne-wcr-is, is 
n--jw. Trie method crjr.pr 1 se.s : 

(/a) sele-tinq a specitic protein fr<jin a specie... involved m 
a r-!gu:itory netv/or^: of on'o^res-; 

(£:■; i-:i-r-ncif y.i.n':| rcnowri proteins thao :n:t: :.Y.s^:re-:a an:i 
downstrean if the protein, with:.n -he regulao^ry ne^worK; 

strucoi.ng the re r-i^ :i t(ji y n-t..:.,i- ■ - ^ 
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proteins i l-ntif-ed in {i: ■ ; 

(■::; for each iientitieo protein, sei.'-cting a d'.iiniin or 
mo-if 'and^searching by homology for related proteins in a^ second 
sp'-cies, a related protein has a hci^ilogius dorr.ain v-r motif; 

(e) producing a regulatory network for the secmd spec.es, which 
incorporates the i :ient i f ievi related proteins; 

(t) C'.:iiparin J tne netw-irks ot tne tw^: spe.iies; 

[a] Identify mg a protein present iri '^'niy c-ne tne 
netwc r f:s; jrid 

(h) isolating a nucd.ei';- aoil nijlecu^e -:!n:-:iaing me protein 
identified in (gj m the species in which it is misiing.^ 

Iht'EPEHBEIlT iZLAIMS are also include 'i f:r the r-:. llowing: 

(1) identifying the el feet of a gen^-^ knocriC'Ut <■ n a regulatciy 

^■"""(■) Identifying the sh-L-rtest n:n-or:enrei pathnay connecting two gene 

^'''^'"''^(P) 'assigning an initial sign value of K.mus t ■. the knockout since 
the knock.jut gene is inactive; 

ir) mnving alona the shortest patnway between ^.he two gene produces 
multiplying the sign^ith the sign of the next gene product m the 
rarnwav, where minus stands for inhibition and plus stanas for induction 
or activation and zero stands for lack of interaction between two 
proteins in the specified direction; and 

(d) d-termining the final sign at the end of tne pathway, where minus 
indicares inhibition and p'us indicates induction or activation of the 
pathway; 

(2) identifying a novel nucleic acia molecule '.-needing a 
protein of interest, comx-rising: 

(a) selecting a gene of interest and searching a database for 
homologous sequences ; 

(o) aligning the sequences identifieo in (a); 

(.-.) constructing a gene tree using the sequence alignment; 

(d) constructing a species tree; 

inputting the spe.-es tree and gene tree into an algoritnm wnich 
:nregrates\he species tree and gene tree into reconciled tree; and 

(f) identifying orthologoi:s genes pr^bserX m one species but missing 

J another; 

(3) identifying a nr.-vel gene, :ompr isin':: : 

(3) d-M^ininq a motif or domain corajzos it i on of a gene of interest; 

{■-■g s-ar-hinq for sequences which correspond t ) nucleotide sequences 
m an expression sequence ta:] database r other cDhA datal: .se using a 
program su-^h as BLAST and retrieving the identilieo sequences; 

(-) s-archinq additi:nal databases for expressed sequence tags 
-"^ntaining^the domains and motifs characteristic fir the oene cf interest 
wi-h a hidien Markov modei of domains and motifs identified in (A); and 

ii) ioentifying nucieotide seguenies comprism? the gene of interest; 

lA) extracting information on intera :t ic-ns between biological 
^n-ities from natur al- language text data, comprising: 

.'a) parsing the text cata to determine its grammatical structure; and 

[t] regularizing the p.arsed text data to form structured word terms; 

and 



\ ij ) a c oiiip u ti & r s y ate: ni i o i" 'A L. I' t i I i y i j. i u t !■ i ci L .1 r l o 1 1 L - 1 l- 1 o g 1 c, u x 
entities fr^uu natural - language text aata, ■ : :^np^r isin : 

(a) means f :-r parsing tne naturai-lan ruage text data; and 

(b) means f':.'i' r egularizing the f.arsed :ext dat n term structured 
v/S'rd terms. 

Uorl - Fs-r i':ient i f y rng novel genes .-in'i f jr natural langua':je processing 
an'i extracti'iTi ot re 1 a t i-jnal Informaticn a.:.s jciate i vji:h genes and 
pr jt^^ins that are fc-und in gen'jmics j'l arnaJ 3rri::Ies. 

Ai'VAiiTAt-^E] - Th-' m-'tri'jd dll-jws th- reipxa retrieval of In f c^rmat i on from 
literature and m ^nir^u 1 a:: i.'ZTi of ceriveo fun'*:i'jnal d^t:i, removing a 
r ese.^rohers neec to r^erf jrm 1 aoc^r i'l-as ce=i0^n;; and :r.ana3il integratic-n of 

L^wg. :i / 2? 
FS Cel FP: 
FA AF; :F'N: 

MC f:Td: B'j4-F01; B11-':'0F; B12-K04E; D05-H09 

EE I: :v:::-FO::^; Tvl-; 
TECH oFTX: 20001209 

TECHIjOLC'GY FO'Ctt:: - PIiOTEOHM(JL(OGY - Preferr-d Metnod: The nuclsdc acid 
encoiies ^ human protein. The related proteins are 

■'.■rt h'l'logs . The regulat'jry pathway is involved in apS'pcosis. The specific 
protein froia the first srecies is invcdved in tum'jr sup^p^ress i <: n . 

TEOHIiLJLC'GY FijCUo - Ct'MPMldMG AKIj Ct'NTriOh - Frelerre':^ Metnc-d: In the method 
^: f {!), "liie f':vl[';v;ing -^Ig'.ri'Lhm is u.:e':i tc- in:egr.::-Oz the .rue^ies and uene 

■a) ooiup'Uting -ihe -irid Parity s igma { 3g i , os j ■ fs-r each psir C'f inferior 

r./:-des frsm treei Tvf and Is; 

■■ t ) i i na i n g t h e ir.a :-: i mum s i griia ( S g i , 2 s j ) ; 

;-Tiving ;rg i -is ^ new :h.;s-^er of or t h^d ,:..;r.: , saving {"gi)-(,?si) a= a set 
■ : f sr:-:-ci-:c tnat "1 liK'^il;^- zo nave a gene '-'f rhis < i nd (o: i^-st it in 
evolution) ; 

-liniinating F-ji frc^m Tg, lg:=l^^/Fgi; ana 
(e) rer^eating steps (b)-{d) until Tg is non-empty. 

The methC'd of (i) further comC'rises using eacn ide:;:ified expr-es3ic>n 
sequence tag to search S'E-guence databases f'l-r ove r i app) i ng se^ruences, to 
-uSseKiC'le longer overlar^ping s^re^.-hes of oV.A. The ::i-:nod f (4) I'^i Lher 
■r.:.mc'r i ses preprc-ress ing, the data p'ric^r to parsing. F rep-r.: ces s ing 
■:-omprises identifying biological entities. Tne method further comprises 
referring to an additional r^arameter which inaicates the degree to which 
subphrase parsing is to carried out. I'he parsing step may coiuprise 
segm-v-nt ing the text data by senten-i-es and segmenting each sentence at 
identified words or pnrases, or at a P'lefix. Alternatively, the parsing 
step* comx.rises skipping undefined W'jrds c^r i'lient 1 f ying cinary acti'jns and 
their relationships, ana identifying argumencs associated with the 
actions, the meth'jd m.ay further comr>rise perf'jrminj error reacc-very when 
parsing of the ''lata is unsuccessful. The error rec.:ivery comp'rises 
segmenting the text data ana analyzing the segmented data to acnieve ar 
least p'artial piarsing 'i-f the unsuccessfully r^arsed 'iata. Tne tagging step 
•i-.C'mpri ses provi-lina the structured data c-jmp-icient m a otancr^^rd 
Generalized Markup Language iSoML) C':n.p'a t ird e f'jrmat. 

t'ref-rrea System: The systeni furth'E-r ■iMiTLt) r : s*:'S mear-iS f'jr p-r^rpr c-cess ing the 
'"iata pri I'r to parsinc, and nieans f'lr refer'rin; tD an a :id i t i^: :a a 1 parameter 
v/hich indurates the ■:ie?ree z.o wnlch s^liphr-i^se is to be carried out, Tne 
pEirsmg means furttier- ce»mpirise means fir segmentin-.j tne text data into 
sent-nses an:i means f'lr se'irmentin^ the senten::es a*: iientifieil words, 
pnrases or p^refixes. Tne piarsino' means may ::imprise means fiu' s^ip'p^ing 
undefineii words, means f'lr identifying binary actions and their 
relat i anships , and nieans f ir iientifyini^ arguments associated with the 
actions. Means fi'r pier f o rnd.ng error reiovet-y v;nen tne parsing is 
uns uc rress ful may also oe included. Tnese lomprise means fiir segmenting the 
text data ana means fc-r analyzing the segmented text data ti^ achieve at 
least a partial p>arsing it the unsus sess f ul 1 y parsed text data. Tagging 
mearis ci^mprise means f:ir p)rjviding the struitured data com.pment in SGML 
ii'moatitde format. 
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PNN ti. lO- 4 ■■ : 4 69 PFC C. ' 0 Q u - I 9 3 1 3 4 

TI Lifiat.i.r.q property va.Lu^r.:; cf .^e^iuences P-y tne sel f :'?rre 1 at ion of the 

pr<-.»p^:er ty value/;, tP^r pre- 1 i -I't ina protexn se<::oridar y strooture . 
PC fv'4 LF<^ >TP4 TOl 

FA F.'FIF-in PEIBiiTFU PLPIFHL Kt)(:;;A?'F }-:EM}-'YFSH0 YY 
CYC I 

FI FP poF"^073 7c A PPOoOVPP i.iOot.'j_i^ oiO -5 F^i IP - L P 

APT FT yV:)i!jp7'=7^ A JP 1 ''^'-^■)- K";^P 4 IPF;^''!]-:^ 

FFFaI FP 1 li i'^F4 1-9:^0 Hp 

IC fFI G06F015-18 

:rr. G06F017-18; G06F017-50 

KJA i :^ ■ /i\ ' 1 4 -i'-^'' ; i PFF ^ ; - P.;. 

A3 FPP^'-'""JPP7 A I'PAP. : .FjJi 'Mj'j 

li(;VFF.TY - A mePhO'i (141; L^.r e^.t iiriat : nq p'rc^fOrrty valu^-';:-, by the 

o*.F r -o^^r2:>■■l 3: t .1 n --A ^ :st iV !' '?]:; ert •/ \'a^u^:-5 wr..i':h oep.erni-r--E- t:;e 

L-^har.-:i'-:t en s : 1 O'f the n.r.'ure c^f a (.■riL^-dinienoi'jnai s;/irLp'i:-L string when a 

t^'v; ot ^ne p-aiticuiars are p i^i'::e':i i r: ■-equeriL-e , is nev/. 

PET/'.ILED L'ESCFI PTIfjF - In Ml when the r-ne -dinien/' L'l na 1 symboi stiing 
.:.s exp'Tessed as a sequenoe string ae]jending on me prcperty values, and 
tw<:^ syntH:.. strings and property values ar^ giv':n, tnr-n tho picperty value 
:s charaeterizea r^y expr ss i ng tne s.i idi 1 ar i t y ot the sequence 
profxle as the LFC 't^-'pstiuir. d.:stano^^, ana estirr.ating it f r "Mn the 
i n^j-diiu^:":::. .e: nri 1 i'/iix^jl -tiing ^^'n^.':::: l-iJ'i Y^i- 'i-n i tr uc:t ^^P li^iu /yrrJjols having 
u n P.nc wn p' i c-p' e r t y v a 1 u e s . 

An IFCiEPEFFEMT ttLPvIM is als^-^ included tor est.in...t i ng the 
se^-cor :. ^iat i'. n the f.u -i^p^e.: l valu-s whJ-ih determine the 
characteristics oi tPie nature '^f a '.■i\e-diiiiensi':ni ^i syrrx'cl string when a 
:-v; the p-articu^ars ai-:- rda:e':; in sequer.':-e. Tne ir.^'S" ap'r-r^jpu late 

•■^x_^^r^.': \'alue ft-r : n-. u-t er nimat i jn Lvf the :'n ara'St^i- : 1 st i :s i^^s nature, 
IS rc-und tr^MT tne r.ativ'^ se<7Uen'::e, \/here a tew c^f t.ne p^articuiars in 
sequence ccTiSt itut*- tiie ■.■n-'-d^iuei .s i '.u':..!! s^'iilP-iiT sti Ln':;. Thic is done P^y 
r' mding the tine sequen.re p>atterri s^etv/een the one-d inensi tria l symboi 
string, and syir.tois v^hrch artificialiy shuttled these randC'inly, and 
*-:-:pressing as a seojjence string t'f the shuffle sequer.ce ft^r the property 
values 'cf each C'li^-d^iiLenw i'-^riai s_;i[iP"-d stiirig ct'nsist^ng' C'f S'L^quences 
having certain prt'perty values; then while comparing tne similarity, by 
expressing it as the LPC 't-epstrum distance r-etween tws' syiabol strings, 
then estimating by a genetic algorithm so tnat the LPC cep'Strum distance 
r-etween the native sequence and the shuffle sequence piecomes larger, and 
estimating its p^roperty value fr^jm the one-aimens iona 1 symboi string which 
can be C'jns t ruct ed from symbols havma unknown p-roperty values. 

FoE - To ■uetermine the properties of amint' aciti sequences of 
proteins and the h-ase sequences ot aenes. 

ADVAIJTAGE - The amiiio-acid moex can t-e estimated from the sequence 
dfita only. It is similar to the hy't. rop'hC'bi-t index, giving the influence of 
protein foldin'4, ar;i the differences in the three-dii'Lensi'tna 1 
structure of the protein when th-vre is a g'j-jd ■i-E-ai -jP se^j'^ridary 
protein structure. 
Fwg. 0/;i 
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TI PFjfiitoring di f f ereiit ial expression -jf ger.es in f i lam-nt tjus fungal cells 
ises f Pu'jrescence-labeled nu::iei": a ::i :is isidatO'F frj::i the cells ani a 
sutjstrate of expressed sequerice ta-:is. 

P: B:.)4 '7u6 Lilo TOl 

IF F:^EPKVF, F M; FFAFSEF, I F; IFAFPPIFEF, F; OFFEN, P B; F.EY, M W; SHFSTEF:, F R 
E'A iFFPiVt)) FC'VO NOFX)ISF AS; (F-jVO) NFVO NFRDISK BIOTECH INP 
CYC Sf 
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W'"' -'Or)'>i"'62 A UPAh: 2 ^A' ' 1 L •> 

K.-,.r^.v'- N.,w r.etr...:i Am non t Mn;.; ..:::erenn.n ex. :.on o.: genes i.n a 
firot r L.amenAcus ijiru-. (I'^K) --I^ -:li*:Lve e:-:pr-Y . :m :.n or trie :^arr- 
G-n-s M-i on^ or more se-o.-riYi 1: i 1 .::nen- ouo f anga colls JSes 

f La-jres .:ence- i.uDe x^a nuY.^ei'.. a .. i ^ ^ .jh 

e.jKcji- ra^-*^ of expressed so-'^aence cags (2:?.T). 

'dETAILEA DESCPIPTIOn - Mecoo-i for monitoi ma di f r .-ent i.l expression^ 
or g.^nes in a -irs^ f : lanv-ts.us tjn^ul (bD rel t: i v-:; t:o expression o. 

tne same genes in :.ne iroie s.:::on: b F l.o- ooi.o : o- .s : 

(1. ad^ina a r.ixtare of f 1 .ores :-:-e-] aL -r ieo nu :o.. lo^.oi^s^^isc ^a.ed 
f^.-rr, ^v ^^ p^FT -.^iis -CY a sacs: rate j jo : a ^ n ma 3;x array :l b exL-i a o-u 

seau-rice taqs (EST; in tne array, wr.er.r tn^ i. i-i . . . .■J.. i 

an.ri one or more seoond FF cells are lai:ele:l v;-.rn :li:::-rer:t: tirst a.:d 

second fxuoies.-ent retz-r-i-ic ^ . .,-.,,.ce exc-ation 

•;Y,i exar.iir;ino: the t/ t iu, i'::;<->-n .tr i.i.i i ..-j- ^- - ^ ■ 

coni^itions where rr.e i-^ative OKfite.-in °y ^ y ■• S ^222. Av,. 

a r I a y .m whi ch : ... 

(1) the EYTs .n the array that hycridize to the nucleic aciOo 
<-,btained from either tne first or the one or n.ore second FF colls produce 
a distinct first fluorescence errdssion color or one o! more secon-. 
fluorescence emissicT: colors , resp.e-.-tively; and 

(ii) the ESTs m the array that hybridire to the nuciei-: acivis 
,:.k.rained from both the fiist and the one or r>-.re second FF cells produce a 
distinct combined fluorescence erriis.-?ion color. 

riDEPEtlE>EtIT CLAIMS are also included for the following: 
(-) a -omouter readable medium which has recora..d on it an array of 
FF FSTs for monitoring differential expression o: genes in a ...rst tt -.eii 
relative to expression oi the same genes in rr- or iL.re seccn... FE cells; 

(2) a crmputer-basea system for monitor: r.g dif i-riential expression of 
genes m a Arst FF cell relative to expression of ti'.e same genes m one 
or more sec^jnd FF cells comprising: 

(a^ a data storage device comprising FF EJTs sei-.cted trom seguences 
l-7<^GfA(niven in the sceci f ication j , r;uc;eic acid fragments ot sequences 
i-78G0 or nucleic acid segu-nc.s witn at least yOY h:::;ology t :■ sequences 
1 _7 s . 

(A. a search devi.:e for comparing a ta.'Met se<:j-jonce to a FF EST 
sequence of -he data s-o;:..ge device ;f (a) t i.::enti:y homologous 

..equeri.--.,,^ r^rrie-aj. .-.-/ice for obfiining th-.- h-. molo r:-us sequence of (b); 

(2 a substrate ..-ompr isinq an array f BF KSTs r mr.nitoring 
d:fferennLal expression o: gen>.-s in fir.:.- ■• ■■■-■i^--'- 
of 1-he same genes in one or more second FF c---ils; an : 

(4; an isDlated nucleic alio seqienie r:mpiisin..: any of sequences 

^'^'2lgE - The ESTs are used in the riethods :or monitoring differential 
pvr^ession of genes in a first f i l..:r,.ntou-. fungal (BB; ceil reiaiave t. 
;a;„,,i-,n carrr. ::er,es ir: me or more sec-.nd : i .arr.ent :us fungal 

AAlAAiIaimea) . Mcnitorinc th. clmal expression ci ceres rr:m BB :ells 
allows the production p:tential cf the mien i.rganisms to he impr:,veu. New 
qen^s may te discovered, possible functions ..f unkncwn open 
reading fram,es can be identified and gene ccpy num.ber variatim and 
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otabiiity Lan L'e irioni tored . Th-r ^i-.nression oi geries Can be u^ed tc; study 
h'w FF cell:? acapt t :> changes ..n ilture ccnditions, en vi ronnental ctress, 
^■t.-:i e mor| '^egene^- ic , recromt inat ion, me tah'C.'. i. c or' cat -iir^o.^ i c: p.cith\;ay 
eri-ji neeririg . 

AC'VAUTAGE - Usirig El'Ts pr.'Virv;s severr-ii advantages over 
genomic oi randrm :L'IIA -nes in-:-; idin:; elLminati<jn ol redundancy 

': ne spc-C on an a: ray e-guils ^:-r.- gen^.' cr c^pen readiio;j frame.', and 
iM'.^hnizat :ron of the micrij^rrays ioiised ^jn function of tdie gen*;.' 
f 'i nu'^t; t facilitaT. e iir-i.iysLs r tne results. 
:;v;4.0/t 
FS Crd EPl 
FA /d- ; V'.l]'. 

MC CFI: Bf-'l-LC:.; Bn^-EC-^; Fn4-F09A; :^ : 1 -C'^? F3 ; E11-C35E5; B12-K04E; 

B12-K04F; cO-l-EO .:; Cc4--.'j5; cr-:-r'0^.^; C L J.-C07B :■ ; l;^ 1 -C:0 3E.:.) ; 
C12-K04E; C12-K04F; F'Ot-HO^-; D05-H09; 
nn^-iilO; D'^S-HlCA; T'")^ -HA. Ad 
ErI: TOl-d; T'M-d05b 
TECH n?T>:: 2:jC'"'U06 

TEcHr:t)LOGy FOCUo - BI0TECHMt)L'3dV - Freterre:! Sequence;;-: Trie ESTs are 
A jremc'nium, Asperigil lus, Fusariun, Humicola, Muc:)r, My :e 1 1'opnthora , 
rieurcspora, Penicillium, Thielavia, TC' LypiO': ladium cr Tr icdnoderma ESTs. 
The F^^..jr.aiu venenat.im Ed'?s are oele-oed fr:n. s-E-guencc-s l-?J^O (given in 
t.h-^ ;-p-.:-;i r ic;ati.'.)n) , nucl-^ic acivi : r: igni--:nt s ■: f so ^uc n ::r;'3 l.-F'/'O !>r nucleic 
ricid s^ouences v/itn ao l-ast ':^'J , pretorrably ^bt and esi::>eciaily .'9. 
h;:moiouy to sequences '7':i. 

The Aspergillus niger ESTs are selected fr^m sequences 7 7 ] -a 37 (given m 
the sp'O ::if i r^ati^ri) , nucleic ac:.d fragments -id: sequen::eo ?"^71-437('; or 
nuclei'::: a'jid sequences vjicn at Least ?L5, preferoiblv :'5:: and est:iecially 

. h'::'mo 1 ■ jg y t o s e que r. :>■? s 3 '/ '1-4 376. 
'-^^.n Asper.-;iilus 'jrysae EdTs ai- selected fr^^m 5eguen:>;-3 4 37 7-?4v^ (given 
.:;n the' specif icat ion; , r:u::ei- ;.'::.d fr-jments : se':iu>vn::es 4 3':^ " -'-hJ 1 or 
nu::iei-:: a:id sequences v;ith at feast 90;, ::referal:dy : am:! especially 
'.-'L-'A-^- n-jmc'l^igy se-Ti^rrjes 4 377 -74Cil. 

The Tri'::.h:jderma reesei EdTs are selected from sequenses 7402-7860 (given 
an the specification), nucleic a.::id fragments of sequ^-nces 7403-7360 or 
nucleic a':-id sequences with at least 90 5, ru-eferabJ y 95 > and especially 
99AJ': nom'.rdogy to sequences 7 4 1')3 -7 -3 60 . 

Preferred Method: The hyor idicat ion ccTiditions are very low, low, 
low-medium, medium, medium-high, high or very high stringency c.-ndit icTis . 
Preferred Substrate: The substrate comprising an array of FF ESTs may be a 
glass support with nydrophobic or hydrophilic coating on its surface to 
which the ESTs are elect rostat ic-illy b:jund n^Dn- sova..enoly , with each EST 
at a separate, defined position. The substrate may be a multicell 
substrate with each cell ^r.C'nta inmg a microarray of ESTs. 

TECHUOLO'SY FOCCJS - BlOLC^Ctf - Preferred Cells: The FF ■::elis are A^remonium, 
Aspergillus, Fusarium, Humiccla, i^ucor , Myce liophthO'ra, Heurosp-jra, 
Penioillium, Thieiavia, T:>1 ypoc La^ili urn or Trichoderma ;::ells. The 3 or more 
FF cei:.s are the same cell or different cell, preferably Fusiriuru 
venenatum, Asp^ergillus niger or Aspergillus Dryzao cells. 

TECHIloEOGV FOCUS - Ci')M?r 'T II :G AtJO C'jNTROL - Preferred Medium: The computer 
r-'adable medium is a fl^-'jj'py disk, a hard disk, randjm access mem'-^/ry (FIAM) , 
r*vad only memory (ROir) Cr-R-JM. 

L 1 2 3 A! ! S W E F' 7 ' . ■ F 7 3 VJ F I :< C3' P3' i-' I C- H T i : 0 1 D E P.W E NT I U FC P.IcAT I OU LTD 

AW ..■)00- 37 "37v^8 [54; WPIX 

DNN :]3000-434t59 F-NC CAKtS- 1 7 1 1 7 5 

TI Clus.:erin.i ^-jene express: :n datapcints m a computer system using a 

SO' 1 f -<..irijar:izing map . 
DC Bi4 Fib TOl 

IN CuLNP, T P; LANF^EF:, E S; MESlPo'V, J; TAMAYO, P 
PA 'WHEFj; VJHITEHEAj INST ET'tMEjICAL FES 
CYC ^7 

PI EP 1337153 A2 20000930 (200054)* EN 3 9p G06F019-00 

P: AL AT BE CH '3Y F)E 33^: ES FI FF. GB GF IE IT LI LT LU LV MC MK NL PT 
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ADT EP 10 371 5o h2 EP J: JO' i-:v')20:y' J0(;0O314; ^A j::.iX.639 A: OA j('0n-2 30(3639 

2000C31';; JP 20ori.3r'^99 A 20iitj-^n:h6 2C:'(mi:-.i5 
PF.AI Ho i9;v:^-:i ^.f; 3 -i i n 

IC \.C\A ':l2O'i01-68; G06F019-00 

'":i:,M'i() i-(.)ri ; (;:l::jO'j i-' ic ; (:i^M'')i5-:j G:):.i](.'3 3-]A) ; 3;i.ii:](i3.:-3 0 

ICA Gj6E011'-3^! 

AB EP ^A371';^ A UPAB : . ■ I'J ) 1 nj;':^ 

ri'jVELTV - A iretho'l r : . ..i.-te riri'? qer.- ^i^cp^■f^-;L:i.■Jn < Pj t. ap:' ^ri^ j .in a computer 

^ jETAILE: T'EGGPrrTI^'::] - [•P.^tri.Dj p.; r : :u ■ -v-r-ir.g < ai :.n^:.;- (each 
'"i--. ap" mt I/; a seri-'.-;: ■:)!: ■j^n^j eMprec .■■ i : n viiifes) l^ a : inp it. e j: systeiu, 
C jM.pl ^ 5C ? : 

a:3iii:j ^ seiL-oi:-,Mn.-^in-4 map {P;M:, : L - :■: t e r : ii 5 ^ '.iatapccnt s so 
that oatapomts tnat exhibit similar pa^'ie^zis are ciusterea together into 
resoective clusters; nrrJ 

(0) providinq an output inJicati.nq the clusters oP th- datapoints. 
IP'DEPEIJP'EIIT CPAIMo are also iricluoed tor t.rie r..ll'jwino: 

(ij a m^-thod i.i .ji.:.ap.:ig d.^tapoin^s m .1 -cmrc-or system, where each 
ddtaps'int is a ^eri-s o: c-'ne express is-r. values, c ■ :i,j;uc s irc : 

{:) re ^-.'ivi nq gene expressic^n valc--s the di ap'-d n: s ; 

fii) Pilterinu out ■i.rs/ .iat i.rc s *: aat exh.d.d:. i s.;u jn l ri cant 

oii.nqe m tn^ gene ' expression value, so that wcrkin:: datapoints remain; 

' (iii; n'j-rmalizing t:.- gene expression value cP *: he w^orhing 
datafj'D int s ; 

( iv) usin.:f a '[^h ar^juping tne v;jr ::ing datacoir.ts so •:nao datapoints 
that exhioit similes p-.:;terns are j]-'„;ped to^-^ner jntv respective 
c lust '-us; a;i.i 

[v] providing dO output inJicatin:; the :irocFS : P the datapo ints; 

(2) a computer ar^paratas for oIjs t er in ;j da tap*'j 1 at s , wn- u e -acn 
datapomt is a series v^ene expressicai values, c :):upr is lag : 

(1) a sc'urce l'P '-^ene expression values ot the uata^'Omts; 

(li) a processor routine coiupled to receive datap'jints from the 
source, the processor routine utilising a 3'jPI for claster^n.j datapoirits so 
that datapoints that exhibit similar patterns are clustered together into 
respective c 1 u s t O' r s ; a r j d 

(lii) an output device, coupled to the pr':)cessor routine, for 
indicating the clusters of datapoints; 

(3) a computer apr.aratus for cjiv^upmg ^atapcdnts, where each 
datapomt is a series of gene expression values, comprising: 

(1) a source C'f gene expression values of the datap-oints; 

(li) a filter M.upled to the source, for receiving the gene 
expression values and filtering out any of the c;at a|>Mnt s that exhibit an 
insignif i':ant change in the :?ene expression value, s:: ^:hat working 
datap'Oints remairi; 

(lii) a normalicir:g process, C'jupled to tPie filter, lor normalizing 
the gene expo'ess ioii vaJu- 'i^f the working dat ai:'0 i nt s ; 

'(iv) a r.rocesssr rooitine that is responsive L:- the nsu mal izmg 
process and atiiizes a Z<AA ^froupm.; the wcuPin; ua r .pr. i nt s such that 

datapoints that exhirut sirrilar patr-rns are gicup--d t-^jether intc 
r e s p e c t i v e q r ■ c u r ■ s ; .:d n d 

(v) an output devic*:.-, CT-upl-rd the Fr^cessor routine, for 
indicating tne clusters :t d.::t ap : int :: ; 

(4) a methoQ :or assessing -rxpression p.atterns of twa or more genes 
in cells, where th- expu' ess i .an patterns are r epres-^nt eol by oatap'oint s , and 
each datapoint is a series •'A :;ene expressicn values, c i-np r i s in.:; : 

(i) receiving the gene expr-rssi'jn vaJues af th^ c:irap:ints; 

lii) using a .AsM, chasterin:; th^- datapcdnts su :h tr.at oatapc:ints that 
exhibit similar patterns are clustered togetner int.: respective clusters; 

■:iii) prcvidir.g an -utput indiaating the alusters of aatapainrs; and 

(iv) analyzin-: the outp-ut to determirie the sirularities or 
differences between the expressian patterris af the aenes; 

.',5) a method af determining relatedness cf expression patterns af two 



jr mere gene, 
ind each 'iat- 
(l) rec 



■■ , where trie cxp-ress lOfi p^aL-LeriiS 
ipomt io a series ot ger.e expres 
ri.vinq the gene expiessi^n value.: 



:ar 



(Li) using a SC-M, clustering :he datapoir.t 
-xhitiit similar piatrerns are clu3t--red togetheT 

(iii) provr.iling an c-utput i nc: i it in-:^ t h- .:: 

(iv) ar.-.ilyz i n ^ ^:ne c^utj.ut t'.. <:ie".ernine 
differences r.'etween tne expiessi-: n p^atterns -A 
determining ^he reiatedness of rne g^'nes; 

(6) a iT> thol f ■: r cha rar-ter i ^ i ng expression 
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expressi 
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the effect 
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sample) . 

ADVAUTAGE - Using SOMs to cluster gene expression patterns into 
groups exhit'iting similar patterns rtakes it easy to analyze gene 
expression '.^.ata from potentially thousands of genes. 

E'ESt Rlr-TIOi] t'F DFAWII]G(o) - The figure is a sthe:natic diagram 
illustratin',7 the f.rincip.le behina the se 1 f -organ i z In g map', in wnich 
initial geometry of nodes in a :-xs rectangular grid is indicated t^y 
lines connecting the noces, ::atapjints are represented by 



genes and to anaiyze the relatedness of 
the unknown funL:tion of a gene unoer known conditions, 
f unknO'Wn cc^nditions on a known gene 
f suL:cessful treatment by a drug 
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nodes are represented by large circles, and trajectories are represented 

h'y arrt ws . 
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MC CPI: &O4-CO:; B04-E:j1; B04-N04 
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D05-H1^; D05-H17; ['05-H18 
EPI: TOl-J 
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L123 AN3WEP M OF /J WF. (.(jfm'oni zo'jx L^rLr...i:..i-^ i ± i ± ^^x^^ 

AN J "iO-^ 7 -'7 :;■ [ ] wpix 

DIjN I ] 2 - I ( . C - 4 J 4 F 4 3 f:>NC ' *2 0 0 ■-17117(1 

TI L'-te<-:t. imm ;..f WTl g^-ne functional niut.at ior^s and EWS-WTl ger.e 

:M/;i'jr:':; L-y m:.>nin'j: .H':; ■ i :'V/ri::tream q-^^nes jTta'jlated by WTl, useful for 
(1 , :r;ir.-'ri:^ '"'f :.eop!a;;ia --iixi :or anti-canrer 'ir.ig screening. 

DC }v:i4 [■!(■ C(':- TOl 
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IFjVEFI'Y - A ir.ethod 


f'-)r ty 


- de^.^ 


cv-ion of 


WTl qon- 


, functional 





:natac-.n:s or an EWF-WTl ■j.:ne fosion hy non l Fc mng ---y^ression -^vei3 cr 
j ■ v;njFr-3^:r: genes, ■'■nio^h ire re^j' 1 1. a t e^:! by v;Lia ryoe WTl and/ox EW3 genes, 
1 ■-; n '.-J . 

jETAIFEF L'ESCr:A:iFF - F!e*:n-:in f :a: ie*:ao*:iny i lYTl gene 
functional laa r -i t i o na ■ > EV:S-WT: :ena taaia-na m rarg^^r: cells 

: :-ripL'i.ves : . , ..^ i _ ^ 

( ]_ ) ,^,3- ing exoresaion ot at least .: acvvaiS t re "nm genes of v'^JT 1 ^r one 
r.r rr.--r^' downs ^ ream genes of an EW3-WTI fusion m a scrapie of target -ells 
and reference cells having a wild type WTl ana/or EWS gene (tne reference 
.-►-lis are othierwise substantially similar to the tai-.iet cells, the 
cownstrea-; genes are up- or down- r egu^ateo by the wi±d type WTl and/or EW3 
aene ) ; and 

(^) ■roinp^aring the expression cA the dc.wnst rearr. genes in the target 
cells and the reference cells (a difference rn the evpression oetween the 
target cells and reference cells sugaests a WTl functional 
mntati^^n -nr EWS-WTl fusion in the target cells). 

IFDEPENDEUT C2Ai:^S are also included for the following: 

(a) an in-cell functional assay for a WTl sequence 
alteration comprising a method as above; 

(r) methods for detecting a n.utation in a target WTl gene or a 
translocation fusir:g EWS and WTl genes using a computer; 

(.-) a method of diagnosing neoplasia of a t.est cell comprising: 

(1) hybrid! ning a t ranscr if t icn in:iicator -:. f a test cell to a set of 
nucleic a-10 probes^{the transcr iF-tion indicator is chosen from mRNA, cF)NA 
an.-J cPrJA, the' prot-'/S comprise a numt.er of nu.:Fe:c acid mc-iecules each of 
which IS a portion :-t a gene which is actl^^ate:i by cr repressed by WTl or 
EWS-WTl fusicT: protein;; 

(U) d^:-tecting amounts of transcription in..iicat:.r which hybridize to 

e-j--h of the set of or^obes; 

nil; identiiyin:^ a test -ell oS necplasti; if Fyh r idizat ic n of the 
transcription mdi^-ator of the test cell to a probe which is a WTl- or 
EWF-WTl-aotivatevi gene is lower than hybridization using a t ran::cr ipt ion 
in-iicator"from a n-^rma 1 cell, . - hytu idi za t ion -^f the transcription 
inoicator of the t.st ^-11 to a prcbe which a WTl- cr EW3-WTl-r-pressed 
gene is higher thcin nybri dizatir. n using a - ranscr - ion indicator from a 
noi'mal cell; 

yij a mvtho.l of ider.ti f yirig anti-zaiizer drugs comprising: 

(i) contacting a test ccmpcouno with a human cell; 

(ii) determining the expression of the cell of at least one WTl or 
EWS-WTl up- or d twn-regulated g-r^ne; and 



(iii) i n t i f y i ri g Ei l^GSt. c 'jitid j^uri li 'i.i -l r')Dt ^intiicil ci-nr i ~ c "-iMC'^-'ir '"irucj if 
it in^rea3€;s expres - icm of at leas^ "^ne WT 1 )i- E:W3-WT1 up- re ;:?u1 at e :J :7eri6^ 
or de^re^ses express:, ion ot a WTl ■■)! :-:WS-WTl i :)v;n- leguia te i q':-rie m tne 
r. .iman .:e 1 1 . 

n."^E - Trio me:.n..id^^ can :je usO'J tj di.i-gnc'.^-;! cancers, for ir a ::5-3 ::roeninq 
ari:i to functionally .a.naLyzo mu t :i t l jn ■• 1 rr/ jIo' :.n :i tne WTl (^one. 
Tne morho'is uso pior.os b.oso-l on qorieo id-n: i i o-:i as b-'inq re-:iu L ;it ed by V^Tl 
sni/oi: Wil t'lsi'^ns ■:> E.v]:: , ]'n- ruo^r.~):3 dri ijo u^e^i to detierin: no tno 
:ronoti : Jtatuo ot: tno ^v;:.:; .in:! t-j i i-.ntiry v;ni::!i nave oiraLlar otfecto on 
d'jwnst re im ■:|onoo . 

FA At:;; 

MC b'4-oC'i; i:0. 4 -ciJ 1 ; d.4-rjj 'F; -^-FOi.; H j - r'-.' 1 -j jF ; Bj^-HoI; 

B04-N04; B11-::0>F1; B:I-:':i'- B12-K04A3; 
B12-K04E; J^': -Ho.-; D05-H09; - :i i 2A; :V' ' - F A ; 

JM5-H14; o-Hj 7A; :FF,-K;7'::; Fa'-HL" 
FFI: FijS-C; FOl-J 
TECH i.-?TX : FOOF lol 7 

TEFHfJfjFOoY FOFFF - F AjTECH: IAL'jGV - Freferrei M-*A:.:ols: The dov/n^tream genes 
ar:e t ranscr ipt i-i-na : i y reouiarod by trie wild typo WIA qeno ari r ne 
exr^roso i-n 'ji t Fo d'-v;notro^m ■ireno^ : o '-i-^ ;^ 'o ^ i by ;a oor in ^ -^nxowrr: s ^^f 

F-jrO'Unts of r r ansc r . ^lA. s .uro neasaro \v:th :i h.i i::- :l-_^no 1 1 y nuci-Lio ^i::id .i^rray. 
T'F.- da^wn.- tr'-arn j'-^r.-o rr'jmp.r : ne a^ ■:^r,- V/FI .o- r --o- A a ^ ed cF-^o,:.en 

fr-Ma natural Filler ■::eIlo protein -j |:-re ::ur s ':o~ [iljFz.AA, ioLa*:e 
bindrno protein [MF^.?17), FALFHA-M ;ono for Iro;.:,- 1 ; iFo 1 i n 
(^oAFhoF), heat oF'i'jo protein iIoF7-jF F^ji7".;j, oFF poO'd.^ot 
(}]17FiF:0/ heat sFo-o: 70 FF protein 1 {TFF.V7:, ^ruijA f :;ou 1 i.n 
(I-: A) ) , pr-jcol 1 a':r;-n-a Ir na 1 ( Fi : t- ; , t'O-a-::!! ::r;ar i n ' p 1 a 3:10 n^:^ :ien 
a'.;t i -^^r .:i>- inr.^hi^o: 1 {M]4t7AF/ cy:iA.e-0'i^ or 'la. i :irA JjI t- C)!' fr-:'T]\ ra^lAated 

KO ra 1 1 n':i':^yt 03 (X'J'iFA:.), Fr"..:i.n n a t a :.:r mo i o protein (!i:lA:'i^o, 

i uFc^iTl ra vii'us leaepf cr 1 F-A_j\R.] ) (...F. j j ; , * '/y-'O 1 .3/t i-a- F-a laA. a i 1 / k*:.-ratiaa 
(F71F70), tubulin aipha-F chain (fF;':.F.U}, purine nucJ-oaside phosph.aryiase 
(T-17964), Gem iaTPa3e (UlOSaO), adrenomedull m ( LA 4 8 7 4 ; , GPI-anchored 
ur'ji-:inase pdasmincxrien activator surfaiae recerjt'ar ( PF-;'r 0 ) , tissue typ>e 
piasiTLiriOqen activator (X13tF7), qrav:n {riFF__), jur.-F (Aj1I:45;, 
elongation factor 1 alpiha-F (A794 ^-^0), h'Mr.eotic gene rogulat'ar (PA 6977), 
'alorio 91 IF (>i5734b), interferon 'ganjaa t r eat n^.-rA. inai.icikde mPA)A (I^2F683), 
transmembrane I'ecej.'t i^r protein (Z17^'F7), aci-aii: FGF [>F:5779), 
'rartilage linking protein 1 (:-:i74'F:;, inteileuFin (IL-ll) 
(F':AA-;77) and mi t ochonarial phosphate carrier protein AF49F?A). 
The dc.'wnstream genes cC'mp'rise at least me WTl dC'Wn -r egul atei; g-ne chosen 
from caveolin and elongati'in facti^r 1 al|:ona-F. The 'icivnst ream '.jenes 
further corr.prise p^ 1 or EGFR ( endi't "nel i al gr^awth fa^atC'r). 
The rrLOthod further C'ln.p'rises mcicatinij the Aass ot function 
mutati'in m the WTl ■■^ene m the tai'^^at cells if the exp-ressii'n -af WTl 
up'- /dC'Wn-reguFat ed 'lenes is at least tv;:- tirr.es Less/Ajare in the target 
ceFls th.:iji in the i^rfeienc^ i::elA3. 

T}':e dC'wnstream genes damp-rise at least irie EVJF-WTl up -regulated gene 
'1 ho sen fr-am Pseu-'icuaahAes vii-us nucA^:'ar ^liti'^v-n (KFO':''Fio arid T 7 9475), t'Ota 
ci:ain of IL-F recept'ir (MFtAj^F), Gl /F-sr^eioi : i c rycILn Dl (H2'A:.a9), 
neuron-sp'Oci f ic garim.j-F en-alase (M^F':'49;, f uiu ian -^ecio NPM-FIFFl tesulting 
rrom transl'a'-:ation t ( 3 ; 5 ) ( gF 5 . 1 ; p a4 ) !F']aAa4), collagen aipha^dV) chain 
( H'-F.] Fa;, ) , T- iymp'hi'Cyt: e sp'O-rrfFc protein tyr'as:ne kiriase 
p : ■ 6 : ■ k • 1 ■ k ) M F : 3 Jo , i n s u 1 i n - 1 i k e r o v; ^ f a t- -a r r A r 1 d i n g pro tein 
4 (iJFO'^SF), carhepsin B fF16510), cystaA:n C 'Tali?-;), dual sp'ocific 
mit ogen-act ivated protein kinase F (P4f2-A), neurc n.al-g Li ai aell 
auhesi-an mol^:cule Fr^FllF), PIM-1 p rot o-on--'aoene serine /threonir.e 
protein kinase (HFo^FF), cark>oxyl^sr. erase r ^'^u rs-a t: (rAAFA'J*), 
protein-oyr-asme ph-asphatase PAF-1 'Fli2i9) .anci inter ler'an-indU'aed 
protein 6-1*5 (>Fj2 4''-^F) . 

FecpAasia in Kicney aells r Ewing sarcima ae^ls cari b'O deteateo using 
probes of genes, which are activated or rep-ressed by WTl or EWF-WTl, 
resp)ect i vel y . The p^rc'bes comp'rise p^artians of the genes and are pireferably 
attached to a sclid supp'Crt and arranged m an array. F'ref eratdy, at least 



d-L I f ei (riii L cjeMes are used aS probes. Tne VJTl geiies rii a uest cell ore 
;:;equericed tc (ietermine ^he ViTl (ler.oryprc status of the cell. 

L12 3 AI^SWEP. ? OF 7 3 W?IX (^CPYF:IGH7 ICiC::! Dr.RWEHT : NFOPllAT ION LT[* 
AN ::00<)-^-7 jlOl WPIo 

[)i-iN U2 [ ) I ' - 'K: 3 1 9 ijno o;. o m i. - i 7 ci s r 

TI Mont; .1 f yinq peptide vnth s-dectel functxon, us^:-ful 

parr roi:lar : y for Oamniatud horri-iLos, by scre*:-rii:iM a<r. abase toi 
O'tribi na'f i''.'r: of nu'/iero ,.i co.'l ar:d -irriinc' acid .^'ecjuejni.'e;:. . 

DC Bo 4 L db TOl 
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PI V/o' ..()ij(i!ibi.:*:..:-^^ Al ^OijOcs-l (, i.iC-0:-^-i * F^P - '^p 1 2*ji 'ij 1 - 

V.'' * -I'T l^'h '"'H O"-' ipE [■'-' Kj\ Eo r l FT' C^F" OH ('-M OP' ip p; pp P3 PP MO pp 

ISA rT oF' SE T7 pg ::w 
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(;:;:) ge gh gm hr bfi zb il tn :e op f:e kg f:p ?:r f:e lc lk lf. ls lt lu 

BY MB MG Mr: MU MW FB-: 110 NY ?L PY R'S FFJ YD YE YG YF BY SB TJ TM TR 
TT BA YG UY YE VP YY EA YW 
/iP Y'Y'Oy^YyL'9 a yOOYOyj4 (YBOO^iO; '":i.::Q'.)i )l-5s 
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IC I CM G12O001-0& 
: :Y G06F017-30 

AB VJ("i Y:)(;i)oo(-: A UPAB: YOOroojj. 

NijVELTY - FdentifyirL'i a peptide '1; Fiavir^.j --a -^r t r era Far 
function ojmprisinq r >r--:pa r i riq a ■i^':abase >f r:-:. 1 you::: o ^'.)t : d--^ (Fip 
i:id polypeptides ;PP; sr uro:::ov;ri functions, s::reen:ri p 
tFie daYaoase fc-r a C'Smbiriat. ir-n (A; 'Sf rsac le jY o:ies or amrns acids 
indicatrve of (I) v;itri a i^^ii^Y icui-.-ir function, an i _ dent r f yisig BN 
a n ^d F' F' wFi i c h c o n t a i ri ( A j , r s n e vj . 

DF:TAILED DESGRIPTIOi: - An FUDEPEnF'ENT CFsYIM is also included for a 
similar method for ivienti fym^^ peptide precursors (IF) having an 
amidated G-torminus. 

BSE - The method is used to I'ientify precursor peptides 
with an amidated G-termmus, especially polypeptide hormories, 
for studying physiologically active substances. 

ADVANTAGE - The method is not limited t^a (I) expressed by a 
particular strain, does n^jt depend on p>hysicaF measurements (isoelectric 
point or solubility), can be applied to proteins which are not 
apparently related by physical properties, and can identify candidate (I) 
with very low mutual honalogy. Ft can l:>e performed with readily available 
databases and computers . 
Bwg . 0 / 0 

FS opr EPI 

FA AB; BGN 

MC YPF: B04-E01; B04-N04; 31F-G;J8; B12-K04E; 
B12-K04F; D05-H09 

EPF : T0F-Y053 
TECH UPTY: BOOYFOY'^ 

TEGHNOBOGY Ft)GUS - BFOB0)GY - Pr-r-ferred Process: To identify (11), the 
database contains botn PN and c-i r resp .>nding P? sequen:es, espe<::ially with 
BP o^mg derived by translation fr^m BN, and *.-ach BN is translated into 
^:hree PP {one f c r each reading rrame). The database aFso includes 
annotations related tD the sequence, -^specially origin, s-'jurce, 
:haracter ist ics and references. The s^rreening step addit i':)nally includes 
Foration of the AYG start ::3don and verification tnat no st:^p codon is 
present between tnis AYG and the end of tne c<jding re pion f-:r (II). Once 
PP/PN nave been identified, they are compared with sequences of known 
function and thC'se witn unknDwn function are retained. 
These are then examined l:'::r similarity with s-quences of kn-i^wn 
function, and those for wFiicn n.:. simdlarity c.in oe found are 
selected f^r further analysis, :'r as candidate sequenr^es. Optionally the 
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interpreted to yield conrrrmed nutations. 

[tETAILEB DE3CBIPTI3)N - lUDE PENLfElCT CBAIMS are also included for the 
f:llov7irrj: 

(1) ra-econdit ionina the P)B of a Bdoonip, '^ompriaing: 

(a) tessellatinq the L-utput pattern to matoh a f>rede termined stimulus 
pattern yielding a 7esseilated output piattern; 

(b) extracting local paranetrics from the tessellated output pattern; 
{c} determrning if a degree o: ampditude wandering representative of 

^.he local parametrics is v/ithm an a 1 Ic^waljle genernt-jr function 
1 im: t ; and 

(d) lL n:jt, ren7'rma li zing tne tesselBate^i 'jutput. pattern t'j mat.:h 
sfieotral r^rori^rt ies of stim.ilus f.-attern and refje-r^t m^:] (b)-(d;; 

(3) f 'e r f ."jririinLi a ^'.tjrive rgent reverP'erant dynamics resonance analysis 
'■.m the f3P - 'f a bi7-diip o:- i-ientify natations, atmprising: 

(a) determmi ri J rescriance -lynani'^s relaxation values based '^n tne 
preC'Dndit i^jned outp-at pattern .und resor.an^-e sti:Tialas; 

(b) filtering the ■.iyn.ami^a relaxati'jn valu-i^s u.:;ing -ensemble boundary 
an'i CCR filters t^- y-eL'i a sec-'aii set of valaes; 

[■:) applying bu..K property estimators to the dynamics relaxation 
v^ilies to yield a tnirl sr^ of values; 

(■-ji evaluating tPie se^'ond and tniri sets of values to ieternine if 
r*.^S':)n an-::e jori*/e rgen :e nas ijeen d^hiev^ed; and 

■: e ) latermining if dynami'-^, para.t/sis has occurred, and repeating the 
meth-jd if it has; 

system for analysing t)? of --i biochip to identify mutati'-)ns 
C'Dmpr IS mg : 

.;a; a tessellation unit operati'/e to tessellate the tutput; 

[b } a stimulus pattern generati •)n unit; 

iz) a resonanse pattern generation unit and 

(1) a resonance fiattern interpretation unit; and 



/ 4 '-"^ "i 'J 1 



(-1) system tor performing motrijd (2), ocmpixsmy: 
(a) a determinai- i oil unit; 

a fiitei- t-:- i liter -iynamicc 2:e.: ixat : <-'n valued' using ensemble 
b-:-undary and CSP filt'.rs; 

( ■ • ; an *'.;sl i mat :.-r ur.it ; 

('■i: an vn I uat; i -^r. anl^.; and 

fe; a p.ira lysis ■::e t -:-:ti :)n unit. 

[J::-E - The n.eth'j:! is used to identity mut..it ic-ns , { jarticular iy for 
diagnosis of disease (e. .:^noer and a>::uired immune defioiency syndrome; 
arid to ident.ify sus sept ib Lii ty to iiseas-:-. 

ADVAtrr/ViE - T( ssel 1 -J tiL-n ol' 0? improves she repec;Sabi lit y of the 
a a i 1 y !■ - 3 , 
tog. / 

Fs :ti EPi 

FA As; oC:i 

MC rri: B^J4-F(;'1; B04-N04; iril-o03; B12-K04A1; 
D05-H09; :-'Ob-iil.'; X 
EFI: TOl-J 

TECH IJVT/.: :'0(:tl016 

TECHNOLOGY FOCUS - IHS TF-CMEIiTATIOri AND TESTItlC - Preferred ?rocQSS: OP is 
■:l-teLmined frcm a rrisroarray ccrnf-r i smg many :ells ea^.^n with a set o: 
i ienti :al iimnob^ li "o^:! -m i gonu'::! ^svtides , iifferent f:^jm those i.n a:.! cOiher 
;:"il;:; rastisuiarly a ''j<: \ spe^ dot: ■ .y i am . Tessellation S'jmp'rises aiapii i ying 
highly lO'Oal m^irph ^l':.q i ri 1 variatiions in , and generatii-n oi ol' is based 
on Oulntuiri ExL^re-s^.-r Functions OJE^ ) • OP is se.s se 11 .i to- i to matcli 
t ne^Tlorpholo/ioa.. ■ li.rt aot en st los c.f OEP. C'PMonally i3cal S'arametrics 
:LP) -jire isolatea rr:i:. ^ne tessellated 'iP :Ui i i deg::ee cf amplitude 
w jnderrno, r-rp'reser.t at : or ITO is definea. If tnis degr-i-e is n^^t v^itliln 
^ or^-idet ermine :! al lowalO^- .j^neiaror function limit, t ne 

t'vssellate'i i=P -S len.Mmjlized t-o nat^h furtner speots-il p ri.pero i es ^of the 
i-^ssroinc-e patterr., l.i ar- r ';pj:e.-;:.-nt: at 1 ve jt inte^r^ited d-:ssi^'y ot 
srares within ea :h r^-gi'in o:^the tess-;;l 1 a tea OP, ani the oe^iee of 
amplitude v;anderin'.i is determined L-y appiyin'3 ~:i Palm distt^ourion to 
develop' Generators fc^r estimati-on cf stcohastic wan^Jeiing. I'pOiDnally the 
tessellated OP ar^d 3P are transformed t^' a metrically transitive random 
flel:!, optionally tod low^d by renormalizat Ion .if the -essellatei OP. 
Generati'sn of CP irs7olves selection ot a subset of :r.urations and selection 
of a sutiset of nonlinear QEF, then transforming the OEF and tessellated OP 
int'S phase space. PP is generated by iterative processing of OP t^y 
performing a convergent reverberation tc prc'ouce a resonance patt.ern that 
represents resonance between a selected QEF and the tessellated 0? until a 
p reoetermined de'^ree of convergence (DC) is achieved between resonance in 
rP and that exoected for th- set 'if mutations. Optionally the resonance 
dynamics analysis method Includes filtering the diagnostic conditions 
identified by the cirr/eraent revert^erant dynamics analysis cased on 
clustering properties, and a diagnostic decision is made based on these 
conditions. If tne debris ion is rie native, tne av ailat'il ity oT. an 
alternative is determined, ana if none is available then a new subset of 
mutations is chosen. Tne biological Son.ple is I/MA, sI^A, proteins 
, peptide-nucleic acid, '.-r samples from tar^jeted nucleic a^rii 
an:pC 1 f icat IcTi . 
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II [7n,::mic storage, letrieval ana analysis of ^: *.r ime nt al d^M e.g cell 

biologi'ial studies an :i drug aisr-overy usin:j a graphi^ral user interface tc- 
represent biological entities and relationships. 
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bl (jH [jU h\L bM tjM Mi- rn ' iL.' 1 1j lii lo r r-.L r-o r.r r.n r- ijL. i^a i^o 

LT L'; LV MA [-IJ [•](.; MF Mi: MV; MX Nj NZ PL PT PO PH 3[) PP 5G PI 3K SL 

TJ TP: TP TT TP MA [K- UP UP VH Y': PA PW 

AU P0000300P7 A P0iP:OMp.l (P0010<) GP( F(i ;7-pr) V- - 

ADT Wl) PiiQij049PmO A1 VJi .) P00i)-UP43Pl PUfiiP:)2]8; AU P O'JOO ^ OCi; ;7 A AU POOri-PCiC P7 

P r:(Vl(')Pi8 

FDT A'.i .pV")OOPi i'AP7 A B.i::-*e^i 'jri VJ:> iP 0(i4';p:-;i) 

pRAi 'jp 1 ^^^'^-iPU"Oi :p:^'^'''mp : 

IC ICM G06F017-50 

I'AP G06F017-30; G06F019-00 

ici i:p:)PiF1P':^: co 

AB VJ'j P ''iPi"!^ p- .4 A npAP: P"/ ■ 

IPjVFLTV - A methV'-i r-jr ryyu^n^-. V 'j-ra^e , re*: ri-v.;ii c^rici di.sp....y of 

Tht- i'ie*.hu':i ^i:— a ■pi^phPoal u;-^-r _n^err:a'-:e ' 4 : : rom \;h..:_.:h ohapcr? 
■ ^0 A-'*'! / P^^'/ / ^ U'J, !0..:) -in-i 5:rrC'v;;:- ' 'Pip 104 , l')P, iPA~; , IIP, IIP) la ■}) resent: J.nq 
D i oi ij'fi ca 1 entiti'-.-:; ■irid ■. i aiis : o;r iria i.cin^: re.ipec*: i::-.:!]/ , ■::ari L^e ^nf^ut arid 
edlt^'.l. Mult i-dj.m'-rioiC'nal informal ^.o:. based on a pr<..-deterrain(^ii, but 
e/ip-nridable, hier ^rohy 1::. ir.put zc relate ^lae in t o r ni. 1 1 . ';)ri ad^ii".iorla I 
Lntormation, e.q. tr.jrn ?:Pe Interri^jt. 

METAILSD PEPCRIPTIP):: - lUDEPEliriFNT CLAIMS are ai.:o in-i-Puded far the 
f 'jJ I<- wrnq : 

■ !) a rnetriod f'jr di ,r r) 1 ay irici exper : merit. ^1 ^aP-.a j'la*. :.en v;it!i rjt-^t e rnu ned 
r-Iat L'l-nshir.is fr-iri "1 r^:n P:e C' >vq.^K*j-ii ; an-J 

il] :i;ys~eri P^r ayr.aaiica± J.y ot'"'rin:i r-- r r i e'/i n^j .n:i i.i : ::p ^ay m :^ of 
ex^'er imenta I in f -.ari 1 1 a with derernined relot^'jnahip.:; , 

UPE - The neth.jl sya-.^m la useful Per 3--:;r:ri-/, rerr^evPnc ani 

arialyzmq e:-:pe rir.err. al 1 ri ^-n ^ ^ i 'yn r^:a: use ^n 'jeLi h^:'ioqi::ai studies and 
■iru-7 'ill so "jvery . 

A:iVAi:TAaE - The sy:fto"i a L : ov.;s easy s--)r^.je, re^rve-'a. ,aai ^analysis of 
oA-iljqioal "Ln f ':aa:ia t , . on ^ ;.!::ia*. e v;ith bacAO' :i :i ^ pathways. I ■: can acoess 
hierarohi'.:a I m f jr:u:i tiv'r^ assooia'ied v;ith I:- :':-l-.)y i, :ai pathways from 
multiple fjrivate and pur.dic dataiiase^'. 

DESCFIPTIOn ()F DP'AWIIP^IS) - The figure is a bl-jcv. di^^^qram showmq a 
screen display of a graphical user interface used to create, store and 
analyL:e taological pathways with determinevl relat lor.sh ips . 

grap'hical button fcr selecting a shape 60 

grapliical button for selecting an anew 68 

■■.|raphi:al butt'-.-n for selecting a cell ^irganelle -jr coripartment 70 

outline of a cell 7P 

3n ■i.utline of a cell merabrane 7P, 

other graphical buttons 7-, 80, 82, 84, 80, 88 
Dwg .3/1 1 
FS CPl E?l 
FA AB; GI 

MC CPI: E.04-F01; B11-C03; B12-K04; D05-H09 

EPI : TOI- J0 5B2B; TO I -JO PB4F; TOI-J0 5B4 P; TOl- J06A1 ; T:Jl-JIP:-: 
TECH npTX: POO:")! 016 

TECHIJ^OLOGV FOCLP? - H^STPUME/JTATION AUf: TESTIMC - Preferrei Method: 
Information associated with tne series of shapes inte rconne':: ted with 
arrows as a biolc^gical p^athway with determined rel a 1 1 jnsh Ips , is saved in 
a databas-. The patnway d'rflnes a hiei arch L r.-i 1 rep/r-^senta t ion cf a 
bicdogical function with determined re lat i jri.^ nips between cf-e 
entities and transformations. A c-jmputer re.-jdab^LO m'3drum st-:>res 
Inst ructiC'ns caasinj a c-entral pr^'Cessing ur.it ti' execute tne riiethod. The 
bijlogi-al entity repres-a~:ted by a sn-^pe may oe a cell, a sub-component of 
a cell or an acjgregation i-f a number if cells. Tn-- entioies may insiude 
act:ive or inactive entities, entity innibitiuc?, f:r:tors exchanged between 
entities ir inoerme lia te entity t r ans f orm.at ion prC'dacts. The arrows 
represent biological t rans format ii ns between entities. The transformations 
may inclu'-je transcription factor aitivation, oelluisr hypertrophy, 
protein kinase activation, protease activation, ^ene expression, 
rereptor activatim, ap:.f:to3is, ir.te rr.al i lat ion ci :ell surface receptor 
proteins, mi tocnondr ia I p.itential, neurite ■jutgrowtn, cell 
viability or miotic index for a sub-c-:mp onent of a cell, a cell or cell 
aggregat iO'n . Additional information inputted may loe a com.ponent view, a 
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morph'jlr.ay of the entity, an eiejtron microscope pnocograph, a liuoi^s 
v^.^-; .--f t'ne entity, basic inf orii ^t i :ti , site i nf jrmat ion , function 
inforrT'at.i Dr., enzyme information, reaction miormation, transport system 
information oc a pathway view. 
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P7J: 'at pp pp :y PR pf: pa ep pi pp gb gh 

^)A PT SL' 3E PL pp TP UP PW 

W ^E AN' A^' AP BA BB BG Bf- BY CA CH Ci: CR CPl CP DE P?: [^M EE ES 

Fl CB GP GE GH GM Hii TP IP IN PS CP EE FC KP FR KP LC LF RR LS 

LT LN LV MA MR M( i MP* MN MW I^ii IK) NZ PL tT F P' RC SI"' SE i'G SI SK SL 

t; tm tr tt TP ji\ up^ US iap vn yu pa PVJ 
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;(y.0:rAi4>:, AP VJP' P iRj P - PS ''^ 4 4 P POiAPJP'Jp; AN 2 P i A) (; i 3 P i Au PP00-PP919 

p."P)pr;"-; 0 :■; 

:o- [p,:'' 'i V^il- A B.ise>.i r-n W- : p:'0"o>Pi46 

ijS RV:^9-iP4 6i;:; IPP'nilC; tJS laa9-12P:*4j 1P9P0P05; PS 1999-142299 
R^9>;:i7 0P; PS Ia99-1G4P17 Ir^PillP 
i.:K C:2N009-PP _ . , ^ 

■■ -Rj A'A ii-Ajl -P'J ; A-'lHO'':- Y''; i'lPN0P9-14 ; ClPNOl P-R:'; : 1 PNO A:) - *:.2 ; 

'":i2I]PlP-oP ; G06F017-50 
Vp-- 2PPP5214P A PPAP: 2P':C'lPL4 

N^VFPTY - Mernods of unencr ypt i n-j trait encrypted aene s.-quen-es to 
or -■vide unencrypteo PNA ar^.i proteins, are new. 

DETAILED DESCRIPTIP^N - I [ ■REPEtlDENT CLAIMS are insluded fe-r the 
f'l'l lowing: 

fl) a methcd (I) of ur:encr yot i ng trait encrypted gene sequences to 
prrvid^'at I^ast 1 unencrypted F NA or polypeptide, comprising: 

(D) providing a number of split gene sequences (Grpl) eacn of which 
■-r.^.p^rises a subsequence ci a geneti element; 

(b) transcribm^^ thie split gene sequences to provide a number of RNA 

segment s ; and . , 

' (c) -rans-splicing at le:iSt 2 of the RtJA segments together to provide 
at least 1 unencrypted FiNA; or alternately 

(c) translating the RNA se':;ments to provide a num.o.er of 
polypeptide segments and t rans-splicin; at least 2 of the 
polypeptide segmen^.s together to provide at least 1 unencrypted 
polypeptide; 

(2) parental organisms (ID and progeny organisrr.s (III) F rcouced via 

f. I ) ; 

(■•:) a vectci F-^'^^----'^ ^-"^ ' 

:4) a if.ernod :III) '..^ unencrypting engineere.i genetic el^-ments to 
pcovi'i- at I*-o^st 1 unenci ypt cl functional polypeptxde, 
c-::r;F rising : 

(a) pr-:::ducin'5 an en "ii r":eered genetic eiement coriespondin',; t-.- an 
enccde i functional polypeptide; 



(b. 



"procucmg a secor.3 engme^rred genetic elemer.t corresponding to a 



... „ _/'nd enco':ied polypeptide whion is nc n-f unctional in 
rh- absence of mc.a. f i ca t i 'iii by th- first polypeptide; 
mixino^ the engirieered genetic elements; 
f..i; expressing the :fer:e-i:; elements t^^ pr jdu^re their enc.ded 
polypeptides; and 

■'e: modifyinq tne Sv'-.::-n-i polypeptide v/ith the first to 
pr-o;uce a function second polypeptide and the: el ere at 
least I unencrypted polypeptide; 

'5;. the first and seomi genetic elemer.ts fu-oduced via ■.III); 
(pj the first and se -:,.nd polypeptides produced via (III); 



via (III; ; 

[V, cc»nip^r i 3 inq 1 cr iricr^:: 1 ibr ciri'S.:: or 2 
.? , t h. e I i :: ' r a 1 1 ^ .? > 1 1 e t i v e 1 y c ^ jii .p r i s i. n q a n umb e r 
sfiilt ^en- s^-'::uerice meitiber type;: in vjr.ici. c-'Dnih'inat i ori-s l i: 
=-\;l::'Cijmo Lti ii: 1 ':'r:s the L . t oene ;:eqi;er.oe rr^eirJ:.*:' r types C';' 1 i ect i ve 1 y 
respor: -i a" least 1 L^mplete oenetic eleiT,erit; 

(iC) a -^^jiTipH:.;;: i r i on ('/; ccimp-r is Lri':; 1 or inC're i i^jrai'ie::- ot 
r'.'^p'ul at i .iiri.^ ... f ^iiu^^ricer- \ ivi^^ed Sfi-iit aer.e 3e<:faeri':e£: , the i ih^ra rie.:; 
::':-lle :;t i vo' I oc^rr^p'r i 5 ing a nujv.ber oi er.h.^riO^- r - 1 i ri ke*:! i'--riR seouenoe 

iii-mber tyt.'es, ea^-ti iequh^te<:i by a aLfteieri^ trans-acting : r anaor ip^t i on 

.^y: 1 i t gen-- ,-■ -guen :-e n.em;jer tyres hi.e 3^. i vt:- 1 y ci'ir-^.i^iTA ^. : at ie^st I 
r.:.rT.p'.l et e jenet h:- -lert^ent; 

(11 J a ir^etnL'l (t'l ) f ij.! r e t'tir^iivi nir.g n'jn-;:iv^ : I ap'jih r.a g^vn-- s-^" q':,-:-nee.3 , 
rriP'! i L-.. i n j : 

(a) pr^--:i':oing a nun.ier t£ nt n-C'Ve r a.|:'P' : ng- gene s-i truer: r-s o^'V.r. 
e-jr r espon^:! Lrig to a i f f erer.t: sah'Sequenoe of .'i gene*:io eieiuent; 

(;-i) r: r'-":luo- i n 7 a nunl-ei' c.f crap- nuolei': .itiio eegue-r. ::es e.i:t:h -i' 1: which 
t-veriapis j. mere t-f the ncin-over Lap'pliig gene .-equences; and 

(-::) rect^mbining the non-cw e i: i app i ng gene cequen'::-';? witlt the gap 
n . i'lei .: a- h -is to pa C'Vide rec-Mix- i nec ncn-e'-;er lapp ing g-i ne ceeruencec; 

(1^:] a ct^mp^aoit ion iVll) rt-inp-r i c.i nc 1 or :x.oro iiitrarioc of gap> 
ncrleic acids, :'o:[-p r i s incj j nuinh-er ol g.--P' naedeic at: it: meiiih'er types, each 
'i^f vvni'.'n corep'iis^s a s ai: e ^r-juen-:e icenti'S^^ ol i ooiupl ~iii^r.\.-^ l / ^^d^i": dt 
least j spilt gene sequence lueinter types; and 

(!:■) an integrated systeir. (VIII) r^i.^np rd s i ng a cr-]ricyit'-r or ceTip-urer 
readacde r^e-.Tium, '^L-np-r isin' i a 'i.jta .-e:. ' -^'i r p"--nding K-j a .r'i t i haiacter 

strings ::::vr resp-: nd i ng te- a set ieiect^.i fr'jit. ^pd it g-r.e 3equen'::e:: , 
er.hanv'^r - i i I: ke-c ;:pd it ger:e sequence.:, t r atiS -a ct i ng t r~ -nsctd pt itCi factor 
stque-n'oa ? , engin^-':-r ec gerie-'ic eieinents, ne^n -ece r L ap'pd r. :i ':orio se-queiecies and 
gsip' nucle i c aci tis . 

dJE - one meth'j'US are used lor u:ie:ie:i yp-t irig tr.ai: en^.u' ype:.-! gecio 
sequen'":es t-::- pir'Ovi-:ie uneri':ryp't et; F'.MA an-.l proteins, 
['■wg . 0 / o 
FS CFd EFd GMsI 
FA AB; Jch' 

MC dPI: B04-C01; &;j4-E01; &04-F0100E; B04-N0400E; B 1 1 E ; 

B 1 1 - C'V "S* Bl 2 - KO 4E ; B 1 4 - S 0 ; C 0 4 - C 0 1 ; C OA-E n 1 ; C 0 4 - E^i] 1 0 C E ; 
C04-N0400E; Cll-OQHE; Cll-CO'?; C12-K04E; Cl-3-oO:.; 

DO^L'-Cll; DOS-HOz; DO^-HOS; D05-H09; DO^-HII; I-Oi-Hli; 

D05-HI7; Pt^-H18 
FBI: TO 1- Jit 
TECH nBT>:: 2 0001010 

TECKNO)LO)GY Ft'CUS - B I O'TECHKOLOidV - E'referreti I^etnods: In (T) the FlMA 

segments comp'rise t rans-sp'i icing intre^ns anu the polypeptide 

segments coirLp-rise t rans -spl ici ng inteins. (I; further compjrises selecting 

at least 1 unencryp'ted FTiA or polypeptide fttr a cesired trait t^r 

pu~C'p)erty . At least 1 'tf the unencryp'ted polypeptides is a full 

length protein. (I) further comprises translating an unencryp>ted 

BNA to pirovide -at least 1 ^idditional unencryp-t eti polypeptide (at 

1 e- a s t 1 o r w h i c hi is a full 1 e ri g t h pr o te i n ) , v/1: 1 n i s / ^ r e t h e n 

selected ft-r a 'iesire^i trait or- pu:r>erty. At least 1 step 'jccur-s in vivo 

^■r in vitra. At least 1 of the split gene se:;uen-es is cFO:A. 

1 he first set of sp'lit gene sequences is pv-:-iiood t^y mating a first parent 

organism fv/ith a s-cand grjup> of spdir :rene eeiiuences fdrp-i)} wi.:n ei 

se?.ond parent org.inism fv/ith a tnird :irjup> oi split ■:iene sequer.ces (Grp3)) 

to produce a progersy t-rganism comprisin:j at h^'ust 1 split gene sequen::e 

from the settnd anl third group'S (^^rp^ an:i ■-h-p3) tc- provi'ie tne first 

group of spdit gerie sequen^-res (Grpd). 

rhe t r.anscririn q step* comprises t r ans ::r io i n-:r at least 1 jf trie ^kgoi 
sequen-.-:es anl at least 1 jf the drpjCi sequen^^es to pT'jvide the BMA 
segmen":s. The metnjd furtner c-cmprises selec^in:^ at least 1 p)ro5eny 
organism for a desired trait t^r property tnereftre seiectmg at ieast 1 
unencrypted F;NA whi::h is then translated to prtvide a second unensrypted 
polypeptide (at least 1 of which is a full length protein 



) which IS selected tor a ciersired Lidit or pi.jpeiLy. 

(i:) are either animaiS; plants c^acteria o::: fanqi ;especialJv a yeast), 
and are preferah-ly plants sele-.ited frrmi I ne g-nera Fr.iqarla, Lotus, 
Medi caqo, Onobrychls , Trif o Liuri, Triq<jne 1 la, V.igna, C^t rus, :.inun. 
Geranium, Man! hot , ['aucus , Arabiue^.'S i;; , I.rass ica, F.aphanus , : ;ina|jis , 
Atropa, Ca{:^sicuiL, r^atura, Hyoscy ai'ius , Ly • jper i C(-.n , i cotiar.a, S':'lanum, 
Petunia, Liiaitalis, Majoi-ana, '.t. rnorium, Heli-:.::^ hus, hactucc;, Br.^mus, 
Asparagus, Antirrhmun, lieterocal lis, rF-!:esia, Felar ::'.^ni un, ;-'ani :um, 
?e:.nisetum, P.anunculus, l-eneci-"?, Jalr^ : gi-'^s si s , pucrm i .■• , Brov. lalia, Lolium, 
Maius, Apium, Gossypiuin, Vi:.a, LithVius, Lupines, P ;:chyrhi .:s , Wist-ria 
ansl/or St i z-:-! obium. En p^jrticular, tn-^y ::r^r cro[> plants seler..ed from the 
genera Aur^.^.tis, Phleuin, :a:ty.^?, G:u-gn;m, ::--.;..:da, ..u., C':/.:3, Trioicum, 
Se::ale, Av-i^na, Hs>rdeurn, Fac-hisuni, P;.-, Fe.rt^.:^, St en -t ^ph r u:;:i, Cynsd'-n, 
Coix, tdyr^;.e, Phareae, lUycme, dic-rs, Ln^rS^edus, Lens ansr or Ara sr. i s . 
{ID iri-.y b-r ojrn, ri::e, -^tton, s-'^yh-o^tn , s.r::nurL, v;he , -'i^., h^ri-y, 
millet, sur:tl-jvver, rapesee-i, cmo.a, r-eiis, beans, lens. Is, f.-ranuts, y^im 
Deans, cowpeas, velvet peas, o--:ver, .^falra, li.-pine, vetch, :stus, sweet 
c 1 o ve r , wi 3 1 e r .1 a a nd / c r s >;e e t e . 

At least 1 parental organism -z ::V.i: i number s-f ennari^-e r- 1 ^n ked -r'lit 

gene s-quences, each comfjrising a sui'sequerice ■: t the genetic •■element 
linked tc^ an enhancer se-.:]uence. The first Larental or-janism I'-jmpirises at 
least 1 t rar:s-acting t rar^S'tr ipr; i lci f -ict'j.r .i.-equenje w[ii.ch is ^onlinLe-i ts* 
ih^' first group sf enhari r- 1 i nk-d splii: g-i-rio sequences ind : he sessrid 
c-r-ijanism coniprises a se'S'-ci'l gr^tup S'f -nhan :e 1 ^c\ee-:.l srlit u-ne sequ-;nces 
eav:n ca'ipi rs i ng a 3UOse:|uence tr.e "ler.eti:: 'Sierient iirie^vi .> :i .:)ec'.)ri-i 
ennancer sequence. The se^-ond tjarer.tal organism alss -sm;;. rises a sec:)nd 
tr:ins-acting trans sript i'Sn fastsu- sequence wnish is unlirihed -.o t;ne second 
ar'::)Up 'Sf enhancer-lmked "-^•r)l i t o-n*'' i-'*^ q u^rn :.es . 
The Dr'r.'geny cu-ganism comru iS'i'S -j t Least 1 ■:>: t:ne first gio.'U|" 
enhancer-Linked split yene sequen>-s, :it l-^-ist 1 t j: an- -act . n ;:i 
1 2-anscript I'On facto]^ :;^eqi..-:uice , :.t .east : -'I the seson:! ::r:v:r:! or 
enhancer-linked split geri- sequ-uic-s, at least 1 of the S';.c^n:: 
trans-actmg t ranscript i^on fact-:u; se'^uen^Tes . At least 1 of rn-^ first and 
second enhancer-linked split gene sequenvres are trans :ribed tt- produce the 
FaJA segments (at least 1 of the first gr.j.up of enhancer-lmked split gene 
sequences is regulated by the second trans-actmg t ranscr ipt i:tn factor and 
at least 1 of the second group ::.r ennancer- linked split qen- sequen<.:es is 
regulated by the first trans-acting transcription factor) . 

The meth'-)d further comprises selecting the progeny or^janism f-jr a desired 
trait to select an unencrypted PDA wnich is then translated to produce 
unencrypted polypeptides (at least 1 of which is a full lengtn 
protein) which may then be selected for a desired tra^t. 
At least i of the enhancer-lmked split gene sequences and the 
trans-actmg transcription factors are cDMA. The split gene sequences are 
preferaDly subsequences of toxic genes wnich are spli-.:ed to^:ether and 
expressed m the progeny to produce a to;-:ic polypeptide whi,:h 
renders the progeny incapable of repr^jducing if it is male but <:apable of 
reproduction i: the progeny is female arri vice versa. In th- male is 
incapable of reproduction, the female progeny can reproduce t proauce 
hybrid progeny ca'qanisms which do not express the toxic i^en--- and vice 
versa if the female progeny are incacatl^.' of reprodust ion . 

The first group of split -jene sequences 'Grp:) is pr-oduced t" y infecting a 
h::^st organism comiprising GrpL wi*:h a 'Jor.-r>t -■^mprismg Gi:p'3 to prr.duce a 
rransfected 'Organism cc-mp rising at least 1 f r tin each od' ''LrpL ans: Grp3 . At 
I'-ast 1 of the vectors may ke a virus. 

iri (III), the mo<iif icat i':'r. ot th^.- s-r-:nd no-n -functional 
polypeptide ^■:omp'rises g L ycosyl a t ion, p r^-'t eo 1 ys i s , f arnesylat ion, 
cholester :-l esteri f icat 1 : n, acetyl a*: i -n, met nylat icn, p>h :.sphor ylat ic n 
arid/or deph xsphoryl at ion . At least 1 step may occur m vi-o :r in vitro. 
The engineered genetic elements may re cLtLA. 

Itne first enqineered genetic elemenr enc-joes rdctin li^jase jnd the second 
encodes an engineerea biotin depenienr glyph-jsate resistarite 
polypeptide. The mixing step- comprises rratino parental cr-::ariisms 
s-mprising tr.e first or second genetic eLemer.ts to produce r ro geny 
comprising bjth of the engineered g-^netis elements. The first and second 
proteins are then expressed by the pitgeny. Tiie p-arental organisms 



are as uesci'iL^ea abfj\'e. 

Hnst or 7ar. L;-.rn:y oomp: r : sing the f irsr enqirieered genet is eleinent are 
irife:-cei wL+h a vectv^r encc :lsng t!ie second to prodsce a organisn 
c imp r is.Ti-:] h^oth ( al t^-rnat i vei y , a host romprisinq the second rru^y be 
infecte i vvith a vect-.^r c >mf rising the Lirst genetic eiernent), the genetic 
eienent:- iriay then tie e:-:pre;;sed i.n pi'^geny. 
The vector :i s prefer-^biy virus. 
Method ( VI ) fur the I- jc.::um ises : 

!i\ j^r.g the ie-:*L'rTir:a ne i nC'Ti-o^/e t: 1 -itJtJ.Lna gene sequenc'es for at least 1 

d^-sirei piof-erty ; 

(::,) re- :-::(ioinina the i e-o.-mhined n'.-n-.jveriappmq gene sequences; and 

i'.'p.;3-LnG the "t.: n: : e : :a;j lUc: ri on :;t-ps until, i d.;^.:i^ed 

r- ;o: mru Tie :i 'jene':ic eie:aest :s :'bt.; .:.ne . 

Tr-- nc'n- j-.-er iac-p'ing qene s^^- ruer; :e;-; ar e d':M:^\-ed fron a ci/:.5u geiie and the 
g.r. r.u.:l.-M: ici^i se::uenr--.3 ::.:]:e u-:Uo:o'e :i fr-ra a cryll^a, cryl-:-^ and/or a 
,_;-yri:i 'i-Oie. hz 1 e a i:! t i :':-p may :o:cur in v: vo cr cn vitrc). 

ru 'i/ferio i C^^mpC'Sit i i-fis : in :r/) le5:S^; s p::r.!U iat :;.cns c:-mprise noiiK^logous 

i-j'-'Tiet i'.'^ •i-'leinent s . 

Ir: 'V), :i :rans-acting 1 1 an S':'r i P't .1 on f3ict':r rrorn 1 p-opulation of 
er:hanc'_'r-iiriked split"ger;e sequences reguiates tne enhan-i-e r- 1 i nked split 
i"4-.-r.e sequ^n'::es the other, 

Fivferre-l 3y3te:[i: {VIII) f.rtner crmprises sequence search and 
■•■.■r.fi-ui isoi; .-e:: toi searcr^ing t:>r sr.e-ifio:: rs:clei- a :id sequences. It also 
■-■u-iru- is-3 ar: auron.a:ic s-r ifuv-ricer _:..upl':-:i lo an output rr.::'ni the ::jmputer or 
■"U'mtOJter r.-^oiiurr. v;ni :di a-:':ep'.s ^:;s 1 1 u:;t bjn.: 1 l jm th* :-c :np ... - e t" nr :nel.Luia 
which dir.;ct seguen-ing .-f' sequences sele-tei fr-im spirt gene se-iiuences, 
cr:nancer- : ir:ked split jeue se :iuen'::es , t r :ins- . ::t i sg t ra ns-::r ipt ion factor 
:-^"qu^:^n -y-s, engineere-i gerecic eieiri-nrs, n-n- ve: r 1 iipp in':j -^ene s^-^Aeri-zes and 
■I jCi riuv^leic :icids. it m.LiV also ::oni|:'rise ir, aut.om:jtiC- synt. he:^:i zer cjupLed 

-r. jutr'-ut fr-jn the -^o-.puf'rr - j r ■ -C'Tp'.i te r reii.u?. whicn ".c::ept:s 
■ r uc* ~ ^"'ris f r tt-j :cc.f:u*:-£r jt r\f:cli'jm \-.'ii: -rh dir^-cV. syritn-E'sis of the set 
seJ-ct^-d lr.-;m sijlit ger.e sequences, enhan :ei - Id nlieiJ split :jene sequen-es, 
t r ar:s -act m :i t r ansci ipt i'.'S factor sequences, engirieerei uenetlc elervLent s , 
n-cn- vver lapprnq gene sequenc-s and gap nucleic aci-is. (VIII) further 
■comprises I'c'r'more robotic rontr:d elements for incnatating, denaturing, 
hybj idicirig and elongating a set ^'f recombined non-overlapping jene 
3^(i-j^n -Tes ana gap nucleic acids arid/or a detect'ji 'ujl detecting a nucleic 
acid produced by elongation of the set of recomtc ned non-overlapping gene 
sequences and qap nuclei':; acids, or an encoded c^roduct of them. 
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Ti St rej:.tococcus p.neumc.niae fabZ proteins useful for ■■.iiagnosmg and 

treating mi':robiai. infect lens. 
DC BO 4 Dl*: SO 3 TOI 

In KoUSl^AXTIIlIDIS, A; RdSSELL, F 3; WAPFEU, ? V 
PA 'SMIKj SMITHKLIUE PEECllAX tt jl p 
CYC 10 

PI VK' .:00-'^0.:-Oh^'-2 Al lCr>n()601 (ItOOSO;*- Ei; 51p A(51K035-74 

FW: AT BE CH CV I.'E DK ES FI F? 'IP GF IE IT hU MC NL FT SE 
v;: .JP 

AjT Iv"' JiiOiV:' r'^ib^d Al WC 1 -^-UVl ^. 4 3 3 10 J- 01 109 

p?v\i v: i'-^Oo-i i'b.HS io';hi i V' 

IC 1<-A Au:rV:35-74 

I ^S AkIFu3::-C;; At I ^ t - 1 ; C07K016-00; CiJKOtd.-OO; C12N0I5-00; 
CUItjlt-3:; C1..P011-01; C12000i-t8; aOIKO:3-63; G06F017-30 
AB vvO iuu-; 30601 A UPAF : lOf/Oltlt 

K'lVElTi - IPjcleic aciis ^I; encccmg .r. polypeptide (II) 

>:.si linated tabZ, a r:.err±'er ---t toe fai-.ty arid h i^rsynt setic r^athway family of 
proteins f r :.mL St rep-t : ci .'us pT.eumC'ni ae . 

DETAILED DESCPIPTMD' - IND'EPEWl EXT CLAIMS are included for the 
f j1 lowing : 

■'1) an iS'ilateji p-o 1 ynu :leot 1 ae (I), comprising either: 
;a) a pc lynucl'-.-ot i de rcmrrising a sequence enciding a 
polypeptide that has 7C'-05^^. iientity v;ith a fully iefinei 140 



{i:0 a p-jlynu::le-:'ti ie cjmpri.:inq a se^ci ^enc'e thai., has K)-^'--.'- iiientity 
tC' a p':' I.ynuc: I 't i :ie .::eqaen::T eaci'dirig (IIu) ever it^:; entire I.- 'r\qt':\; 

[':.) a iS'jiated p:'Iynu-:; 1 eot i cornprisinq h sequence thiat has 70-95.'. 
identity to a define':! 4^3 naclec'tide £:equenC'E' fla) given in t ne 
spe 21 f. ieat i -jt:; 

( a |:i':d ynu-::le:'t ide v/nieri en-M-iies ( 1 I ) 

(e) a fi :■! ynueleC't i :ie ciraorislnq (la); 

\i) a Li-id yau':;iee't i'ie ■ji:'tiLried r-y screen incf a libiary und-r stringent 
:: :'ndi t i e^rs v;irh a pr^r^e rising ( I 3i ) ; 

(a) an is^jianed r:":d.-/n le Jt ide sequence enejdir:q 5i niatur-^ farjZ 
protein f . pne'am jf; i a-- ; iind^:-: 

(h) a seqaen::e :--:'mp'ie:rien: i i y (a)-;;;; 
\1) a polypeptide seler.ted f r-jir.: 

■' ^ j ^ peptide e-ia.p'ivi s i ng ar. aniin'i' S'^ici =■ q j-^n having 

identity t^-.- (I la) 'jver its en: i re lenarn; 
(!:■) a peptide 'i-crrX'Ti s mg ( I ha ) ; 
( :.) a peptide ttiat is (Ila); and/-jr' 

(■:;) a polypeptide enc-'jaei i:y a ree-jrnijinant ]:'-jiynncle-jtie;- 
c-i'mpri? ing (la ) ; 

( ?d an antib^rdy (III) anti'jienie fc^r ( ha ) ; 

(4) a rrethjd (IV) for ^he t reatrnent jf an individaal: 

(a) in n.e»V'a 'if ennani'eo aativiiy r- ^ x}jr^Ess i. :'n (11), vsjmr'r.i :e in : 

(i) adiTiin.i St ering agonist (II;; and/ s-r 

(ii) [jLO'Ji'air.q the 1 r.'ii vi d^-a 1 witn (IT) z-j ^•io-a-^.z- t ne 
polypeptide (aria its activity! in viv:-; :'r 

(rd in ne^^ci ■: f inhihdted activity :d~ (TI), I'liitjr i sir.g- : 

(i) ^djui n 1 s t er i ng .un :intag':-nis t 'I'f (11); an':i/'ji: 

(ii) adird rii ste !■ ing a nuclei r a'::io :Ti:de'::ale iha: inhit'its the 
expsressi'tn ^'f -,1); and/.ar 

(iii) a-iiniini St e r in':i a polypeptide tnat 'L's-iil;:-^- t e r with the 
polypeptide fc^r its ligonc, 3u;j^tra*:e tr r S'ce^'t ':r ; 

(5) a iTLeth-jd (V) f'_'r vii agn':s i:.g •^■r r'<jgr.'„'3 i ng iJis-^d^e '.^r 
susceptibility to a disease in an indivi'iaai lelat-E'ti tc^ expjrs-ssi:n zr 
activity of (II), C'jrrip-ris ing : 

(a) det eriTii ni ng the p'resenire cr ali'Sence c-f a iiLUtatiS'n in the sequence 
(I la) in the genome 'jl the individuai; and/C'i 

(b) analyzing and quantifying the p^resence of the polypeptide 
(II) in a samp'ie frt'ir. the individual; 

(6) a method {VI) for screening to identify comp'ounds th.at activate 
or inhibit the function of (II), comp'rising: 

(a) measuring the bin<:iing -tf a candidate ct-mp't'und (t'C) to (II) or to 
cells or meiTih'ranes bearing (II) (of a fusion p^rt'tein t'f {11} ) using a 
label directly or indirectly associated with the CC; 

(hd measuring the binding cf a CC to (II) or to cells or meir.branes 
bearing (II) (of a fusic^n p^rotein c-f (II)) in the p-resence of a lat^eled 
competitor; 

(c) testing whether a CC results in a signal generaced h^y activation 
c^r inhibit. iC'Fi oi (II), usirag detecti'jn systeir.s ^P'pv: 'jp^r ia s; e tz- the cells o.r 
membranes i:*earing (II); 

(d) mixing a CC with a scdutie^n C':nt ain.i ng (II) and meastiring ^he 
activity of the pi^al yp ep^t i d*r in : ne irdxture, and ct-iiip^a ring* that activity to 
a stan'~Jard; 

(e) detectin'g the ef fect oi a (tC t-n tne p'r-acucr i on tif mIt!:A enct-ding 
(II) and (II) m cells usin^, fc^r example, an enzyme linked iriiriunos t-rb ant 
assay; and/or 

(f) a metri'jd ctimprising: 

(i) contacting (II) with the CC and assessing tneir interac: iijns ; 

and 

(ii) det ermin in::j w.nether the ::cmp"tuni interact.: 'with and activates tjr 
inhihdto the o'/tivity f (II) L>y detect iiig th- p'l - s-ct'te o^ a;j.- eri'te -a f a 
signal generated frccR trie interastion 'tf tne c^amptunc an:i polypeptide); 

(7) an ( ar.t ) ag anist (VII) of tne exp'ress itin cr aztivity 'jf (I); 

( ) an expression system (VIII) c^amprising the P'Olynucle'_'t ide (I) and 
capahile of exp-ressing (II) when p'resent in a hast sell; 

(t^) a htst cell (IX) ccimp'rising (VIII) c^r a mem.brane expressing (II); 
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■:ix); 

(IV) a prL>ce;-;3 u-'I) tor pr^^aucinii cr ci :"rien:j::)rarie jf (IX) 

expressing (II), ;':)mpM3 i nq t;r aris forming/' ci^ans feet inci a cell with (VIII), 
so tnat the cell -xf)r.-.'sse.:.; (II); 

(1.:) a C':)mn)ut er r';;;a-:i-:il:)le me-:iii.im 'XZD, upon whii::h Js stored: 
fa; (la) or :>5 qu'Vri ::e 3 ) c-nripr i s iri'ij ■■Id) iiiud/or cither pcJ. ynucleot ide 
sequences ) ; 

'oi (Ila: or seqaenc^iMs) c -aipr is vng (TJa) ■' and/ oi otner polypeptide 
sequerices ) ; 

i'Z) a data set representing (la); 
i- a da*: a s--': r '"-rir n r. i n ; pc; lyr.c ;:1 e ct i :1- s enr; i r, 5 : I ] .a / ; 

(I/) a :::jiiLrju^ er teased :n-;j"n^:.ci f>::ii; ^or n^er f jini i n j n:):uaLor/ 
i :lent i f I cat ion, c-"'in}jr 1 c- in a prC'Vi :iiri -j -j sequen-:*e conipr' 1 si n :^ (la) in a 
C'araputer readatjle medi'an^ an;i z^rr.pai" m :.hat segnen-a-c ta aah-ar :a al ypepjt ides 
and pC'iynucie'ijt id^.- to i'ier.tify .;eqae:i-ae h :)n[-jlc' ay ; 

(1-;) a c-jiuputer i::ed nietn.aa (XIV) tar pc-l ynuc ieoa i de asserut.Iy, 
c<jmpri s mg firovidmg a nu':leic acid ■a^:'itip2- i.s i ng fla) lt: a a-aniputer- readable 
ni^diani .^nd S'areen:ng for at lea.::r 1 ^a v*:: rlapif ling re'ai jn h'etv/een that. 
f> alynucleot ide sequen^ae ana a seaani p alyn^clec-tidta seguenoe; anil 

(Iv) a polynucleoti Je (XV) of the lornnala: 

5' X- (Fd }nL- (R2) - (r'Xi) n-Y V 

X ^ H, a m^atal or a lEiadifi-d n^aale-ataae residu-a cr ^a-ie^rier with Y 
defines a aaavalent h'an-a; 

Y --= H, iTietai. or ^ jficai 1 f L- :i n a :.i e'jt ^oe re;:ida.-a cr ta'-.:e:ncr wi ':h X 
defines a covalent r-'jr.d; 

rd and R3 = .jny raa'alei^a a a i :i ie3Ldue or riijiifie'a nucl-.d. c i^aid 
resiane ; 

?2 = the nuoIeC'tade .:-eqaen-:;e ( r ) ; arid 
la Q'i n •■000 . 

AV7I VITY - Bacter ici ae . 
MECHAMIoM OF ACTIOM - Vaacine. 
Lvj data given. 

USE - (I) and (II) may be used in the jareventi 'a::, t I'^dtaiorit and 
diagnC'Sis of diseases ass<aciated with fah'Z exp'ressi^an and St reptc'coccal 
infect i'jn . 

(I; or (il) may be aaministered ^_o treat aiseases by rectifying 
mutations or deletions m a gen^ame that affect the activity c^f fabZ by 
expressing inactive p'rotems c^r tC' sup'plenent the p re (juct i c-n c>f fat'Z 
]:'Olyp:ept ides . Additionally, (I) may be used ta H'roauce iarC, acc<araing to 
standara recombinant riJA. methodology, by inserting the nucleia acias into 
a hC'St 'aell and culturing the c-11 t-a expiress the f r c'tein (eithier in vitro 
L'r in vivo). Antisense nucleic acid m^alecules m.ay te adniinisaerea to down 
regulate fabZ expiression r::y binding with the cells lawn faL^Z genes and 
preventing their expressi<an. 

(I) and coitipl ementary sequences rr.ay L^e ase<a as ^'lok^es in liiagnostic 
assays tC' detect the p^resence of fabZ in samples. 

The pialyp>ep-t 1 des may be use^'i as antigens in the p'rcdua't ic-n C'f 
antibc'dies against fabZ and m as^-says tc identify mj.a(.ii:lat 'ar s C'f fat'Z 
expressiC'n anri activity. The anti-fah'Z antihiodies and fah'Z ant a^gtani st s may 
also h'e used to dc^wn j'e'Talate fah'Z exraression and a^ativity. They may l-e 
used t'"> tre<at S. ^ineumoni ae 1 n f e^at i lans , 

The anti-fab;', ant ah'C'dies ma^/ aloe be used as (a a agnC';i: t i c: agents fc^r 
detect: rig the presence -af falr^Z p'Olyp'or't ices in samples. 
C'wg . ij / 

FS CP I Et-I 

FA AB; Cm::] 

MC CPI: &:.;-l-BOiCs; 3U4-B04M; B':-l-CtlG; B\i4-Ei:lF; B04-F04; &04-E-:^; 304-E06; 

B':4-E08; B04-F0100E; Be4-G07; b:4-N0.-A0E; Bll-A; 311-rL7A4; 311-C08E; 
Bll-c'X3; B12-K04A4; B12-K04E; B12-K04F; 

B14-A0IB1; bl4-St3; BI4-ollB; ri:5-AtlA4; Da5-Aa'iB; I-Oa-AIZB; [-05-012; 

rM:;^.-H04; DC5-H07; BOa-HOS; D05-H09; LiC5-HIl; LXit-HllA; 

re t-H12B; C'05-Hl JE; :05-H14 ; f-'J -H 1 7A5 ; B05-H17A0; I'Ob-HlB 
EPI: S0::-E14H; TOl-JOaB 
TECH rjPTX: ^■QOOIGIO 

TECHNC'LOi^Y FC^OUS - BIC'LC'i^Y - Isolation: (I) and (IIj were isalated frcm S. 



pneumi^niae . 



7E-:HN( jLL':^? F':'^"^US - PK 'TECH:]f''L(:^GY - Preparation: (Ij and (II) encode may be 
prepar e-:! i.>y andara luet n'j'.i-\- . 

Ld.23 I■:J?^'}Er 14 ("'F wrix t'^^'f^vRiGHT r:^ioi :^epwe:it infijrmati*:'!': ltd 
AN ^:a-.i:.-^.4 yU'^ [.][^] vJl i:-: 

DNN Kr auO-'l 0.::a:7 [^]:C (:2aM:;- 1 ol 

'PX p'- ] - Tluc i -i-^c^*: i■:^e^;^ con r^.^ i in :i r.iC' _ '/nc- r'pnic ^^ar iant:"' -jf a j ef^^^r-riice s-equence 
: ,-,] "'tarri': r noorcsio i a r^ -..r ' Ly'C--rf.>^o.: r I :5^:-ful :or studyinrj the 

i -1 ■: o-?i-:-aI function o: l^NFF 1 -.ind rdentotyiiK; druas taio;etir.q the 
protexn loi ir* Eit^:.:: i-i:. 

DC be-; La'^ TOl 

IN i'MEVJr A; IIAIIDABALA::, K; V ?; S7EPHEi:S, a l: 

PA -'ilEV;-!) CMEV; A; :eEi:A-::; -^EUAyA'AiI^E P[1AH1 IN'- (NANP-I; NANE^AEALAN K; 

.y.;Hi"-:i) :"'::H'=ee v P; (:;tep-:.: ptlphens j c 

CYC 

PI V/'") P(";OOC Al ^ C 0 0 i'j "! ; a OO^.-^ ! * EN "Aij;, ■[^a? H:C ' 1 - PP 

KW: AT BE ClI CV DE jK EA EC EI EP C5P ■ ;H C^M CK IE TP FK PN I:U MC MW NP 
OA FT M' 3E CP CP CP UC PW 
V;: AP AM AT A:t AP PA BP P''^ BP. BY CA CH '^N CP 'in CP ['F f'K PM IlE FS 

PI CP Cr. CE CH ilF Ha IP IP IN IC IT' FE FC FP IT. FP PC PF PR PS 

PC PIi PV MA i-IP I-IP; Mr: MN PIW IPI Na N" PP [C Ilj PM :C.i CP :T; pi pp: CP 
TT TM TI TT TP I^A Fa us NP 7F Y' ' PA PW 
ICJ -iCJ'.-CPi'jPPl' A P^^iiaaa;^ P P '"^ ; CyHiPI-^.! 
APT VPI P000CTU4CP Al WO Papa-NC-loPC PajPQP^P; A^l P'OiPCr PPPP A PCJ IPJPl.'- 3 aPP 9 
tpij.j P I 

FCT Al' PLiIipC! i: P A Ba--e.^ '-.a W'.) P pii:'' 'CO 05 

PRAI PC 1 paM- IP 1 -O I^a'i')aP3 

PC I C-1 T CP PCI I - )P 

: 'p-^ A"0NP4I-PI; CIPP■"TI-P^=; ^T P'.aip I- o - ; G06F017-00 

AB VJ;'' PPa..)501 3»-i A UPAP: POoapniJa 

N:jVF1.TY - An rsolared p-:) Pynucie^jM-.ie Oil-/ :.)r :.as ':a.:.n.] )ie:fLeria , c^-npria^ng c 
p.jlyni'jrphic variant oi a refererM^e aeipjeaae for Pamor near-jaia lactor 
recept-or 1 (TNFRl} -jene, aoraprisin.g at least 1 pa Lyn.orpnrsm sePected from 
a tnymrne at PS (p-da/morphio arte) I, PS5, PSC, PS''-^ and PCIO, adenine at 
P::P, PC4, PS7 and loii .guanrne at ?o3, an^i ayr.js^ne aa PCo and PC12, is 
new. 

DETAIPED DESCRIPTION - An raoLated p^: 1 yna :Peot i de (Nl) CT^mprisLng a 
C':Tyn.orphia variant of a reference sequerrre for tumor necrosis factor 
receptor I (TNERl) gene, comprismq a thymine nt PSl, PCS, ?S»:'', PSIO, 
adenine at PSP, PS4, PS7 and PSIl quanme at PC3, ani cytosine at PCP and 
PSIP, is new. The reference sequence comfjriaes derined IP 39, 13P8, '.'r 
Pp54 base pair (bp) sequences (given m the specif ic -it ion ) . 

INE'EPENIiEtjT CPAIMS are also mclaaed for the Pow mg : 

(1) a recombinant organism t rans f 'irme 1 ■■:r t ran.: f ecte J with Nl, v/liere 
tne organism e:-:presses a TIJFRl protein; 

(P) an isolated polynucleotide {[^Pj ■-omp r is mvj .u polymorphic variant 
oi a reference sequence for the TNERl cPNA, wnere tr:e reference sequence 
comprises a oefmea 3PP^ bp sequence (given iri the apec i f icat i-i-n ) , 
C':>mpr ising a p':Tymorphism sele'^ted from .guanine at na:lec^tide 36, tnymine 
at nucleotide* PP4 -.r P69, adenine at naol-ot.iie 36P, and ■:ytoairie at 
n-;cIe--TTde 4 0 3; 

(3) a recoirbinant organism transf::rmed or trata;fected with NP 
-^ixcressin g TNFPl; 

{4; an isolatr^l polypeptide (PI) ramprisin^ a p :) lym i^rpnac 
varian^, of a reference seoyience :cr the CNFRI protein, where the 
reference se-guen-e -omprlses a defined 435 amine a .:i 1 seq.ience (given in 
f_r^^ specif i'rition J , co^mprislng 3 leucine at amnc acii 7P, m.ethionine at 
an. in- acid P^o glutam^ne at amin-; acil IPI and histiline at ai:Tna a-id 
135; 

(5) an :stdate:i antib:dy to FI; 

(P) a a-jm!:a3ition ::':'n f: r is In n =t least 1 genjtyiiang :T igonu :leot i ae 
for ■lietecting a pc lyn.orphism in the TNERl gene at a p :T ym-i^rph Ic site 
selested from PCI-PSIP; 

(7} a method :MI) f ■: r geno-typing the TNFFl gene of an individual 



comprioing (ietermininQ tor the two copies of the Tht'P.l gene prei^ent in the 
individual, the ident...ty of tne nuclectide pair at ..r .least 1 ?2 selected 
:rom E\:-1-PSIJ; 

(8) a methrvi (Mi; fox. haplotypinc the TNFPl gene of an individual as 
in Ml except that the ident...ty r^t the nucleotide at the PS is fc^r one copy 
o^f ttie TNPld qtM;e; 

{'-^; a rn--tlnj'i {'A i) tor pr^vd i c ■. i no a ha}tlotype pair for the TNFP.l gene 
of an indi'/L'lucil crnprising : 

(a; identifyir, a TNFPd. Lr^-n'/^ype for *:he IndivirjuaL at at Jeast 2 
|o:. Iyrrir-rp>hi'-: sitt-s S':d.ectea fr':rfi TSl-E't'li; 

(ti.i enuirierar: iri^'j all r:>ossilde hacd^j type piaiis v^nich are cc-nsistent 

( a'jcessmg oa^-.a ;::^ntain..ng the vNPld haplo-:y;:e pairs aeter:tined in 
refe-rence tHjpul a ■■ o-'n ; ^nd 

(d; assig::ing i hai-' 1 ^'t ype r tc *:h^^ indi iciua 1 thai is s-ansistent 
'.tith th^- era; 

( 1 i) ) a inetli'i'd (Mi) tor ia^rLtifying art asscc"..ati':ai L-etaveen a trait and 
,-it least 1 'A^rncrypo or natd ot yir^e ot tne TIIPPl gene comprising ::omparing 
t:he treguency o: tne geneiype '.-i hapl'::ype m a reference ]>jpt:lat i tn, 
v;here tne c;enotype •:.'r haplotyije n-^jmpvt i ses a nuclee'tice }jair ^tr nucleotide 
located at PJ'l-PSli, v;here a :\i-jh^r frequency e^i a -genc'typ-e tr haplonype 
m the "ram t't'j^'U 1 t: i ■ jn ^inar. m tne refH^rence p-c piuia: ie^n indicates that 
^ne trait is a ss':-c i ^ ed v/itn tn^r ^genc'type '::-r h ^pc yre ; 

[11; a ^>:nip'Ct^ -r sys^-MT; f'tr s^:c)ring anc .:in ^ : yt ir. ^ t'-ji yiao r p»ni srn data 
Z 'jr t h'i TNPPl a^iOiO : O'lnp^r is i :'ig : 

(a) a '-rentrai p-rc-zess 1 ng unit (CFto; 

(0); a ■"tjirjaunic 3 1 1. e n inteifai:^; 

(::; a 'iispliy de^t:ce; an^i 

i :{) a d -iit ai:>.::S*i sre^n t u i ni n^ j tne p-:d yrio:ra:i! i sm '-ia^t^; anil 

(11; :[i^th'i-:i (Mid fjr S'.-reening t'jr ^irags t..xrgeting TNPRl isoforits 
■ tjinpid s . n'7 'C-nt a :■ t ; ng ^ he TdPld i .-•cd ■:■ i: n w;'d: a cundiaaie ^^gent and 
assaying for bin-iir:g jctivity, wr.-re the reterer..:-e .sequence and 
r;-oi lyiTi':'rr:'hic varian-. s are as describee ^n [■]] . 
ACTIVITY - None :;iven. 
HETHAniSM OP AoTIijII - IJone given. 

- L'et ermination ot the p'Oly]Ti'ort-)hisr[.s in the TMPKl gene is useful 
for studying the bi<tlOigi ca 1 function ot T!JFRI as well as f'^r 
identifying '.:lrugs target irig the protein for treatment e>f 
disor'iers related t its abnormal expression cr function. 
fjwg.O/S 
FS CPI EPI 
FA A3; DCM 

MC CPI: BOl-BiVsC; B04-tt)IG; BOl-EOjo; P.04-E03rj; EC-;-F04; Biil-EO^:; B04-F0100E; 
B04-C')l; B04-Kt:)IK0E; Bll-CdyE; B12-K04A; B12-K04E 
; B12-K04F; D05-H09; fiOS-iill; T'OP-HliA; 
C'Oh-Hl^BI; C'OS-HlJDl; EOS-HliDt; Li:)5-H14 ; Dn:.-H17 34 
EPI: TOl-J 
TECH UPT:-:: jijOOlOOb 

TECHMt'LDCY FijCnS - B lOTECHH'jLOC V - Preferred Polynucle Dt ide : Ml comprises a 

TMFRl isogene. Nl i.s DtIA and further ct-mprises regulatory elements. 

Preferred ^Urganism: The reC'jmbinant organism is a transgenic animal. 

Preferred tlnmpos i t ■.. jn : The gen':)*-cypirLg '^lig jnucl e^jt L':ie is an 

a 1 lele-spec L £ ic 'Ol i. gonur:leot ivie hat sf^eci f ically nybridi zes to an allele 

t'f the TNFRl gene at a regi'jn containing tne pr>lymorphi- site. 

Preferred Methoi: In [-11 and Mi thie ':iet-rmininq st.-p c-amprises: 

Lsoiatina a nu.:leic asi'd inixtur^- c-.tnprisinq both copies tf t:he TNFRl 
■::ene ; 

d::)) amplifying a target regiDn -antainmg at least 1 of the polymorphic 
s ites; 

■::) riybridizmg a gen-jtyping o 1 1 g-jnu slr^ot i :le to orie allele of the 
■rcnplified target region; 

[ i) performing a nusloic acid tempi at e - :lependent primer extensicjn reaction 
on the hybridized genotyping o 1 1 gonicieot ide in the presence af at least 2 
iifferent terminators of the reaction, where the terminators are 
■:-:omplementary to the alternative nusleotiies present at the polymorphic 
site; and 
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In [-14 *:rie trsit :.s a :;lirii::al re:-;p-)n3- t.) :t drug targeting TI^JFRl . 
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^-•y-^i-.ni [4:-: v^pix 

]-M.?44';^ [2^P; /0'iO-43/3»P' [4!]; 2-00-M4o^):' [41] 

:],Mj(iij_ :>,44.:;>-. jtic 14 h,!.. 

J I I i i i C'".) r'-C'jiui:)inar:t nuolei'.: a:i.d pr-pa rat : ofi oy gen-^^Lj. c -alg ji i tnm guided 



■:-^ri'^ ;~ yrr h'-' ;? : TiV't i '/-;s pr'-i'/idin; 'i nunio^u- -^f p.jrent";] 
f tr^jv ij.l.ri g igC'riuc i e^it Pile.?; ai\d 1 'jri ga': i n g tnern. 

iv^ ; P: • TOl 

PFP ';:ARPAPr',F , S; C'CTAFP-P JM , C; MT !I.:"H"'PP, J; Pr^TTEN, P 
. . i'H]!4MEr., W P r'; TOPIlJ, i-I 

c4A/Pi'-i:) MA:\ycFn riic 



'ha ra --r f^trings, 



.1 F'PN< iV, 3 A; 



: :.A/r.' 'y)-F '1 

OH r-M ^:;r if [t ke : 



MC MW NL 



■ IP KP K?. KP PC :.r; lr ls 

P':; PI PK sp 
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00042560 
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17:' P 'JP'.''PR: ';iO A.: 2': ■■^ '■'':■! ^-'j (P';p\^4-'}' 

F.VJ: AT BE CH ':Y jE lj?: EA FP FI : 

fjA ^T Sb PE CP CP TZ PW 

W: AE AL AM AT AM AP BA BB B'P BR BY CA ':H CP CS, ..:p i::z [lE PP PM EE E3 

Fi GH CP GE CH CM HR iP.l IB JP Hi IP 

PT PC LV MA [4P MC MR ICA MVJ PZ 

TP TM TP T'l' TZ 'Pi ' 'P> -'P ''P ZPI Y'' 1_>P. 

A'; Ipj'j'YjZP 1 lii A PP'YjiPi^pl (^pP.iP.;) :;P0Pp1P-P 

F;: AT BE CH CY L'E C'K EC Fi PF. CB CR IE IT PI PL' PIO' IIP 
WP ZP0P04PPP0 AP WC' POOP-PSl POP PP'":"ipn3; AP PPPpPpIlpl A AC ZpOP-3.^101 
P'"'L' I'O 1 1 o ; EP l'IPZ014 Al EP P r"!.^ Pi 2 .' ■.■ 0 1 .o , W'l .. ICP-'JC! 2 IP 
AU POuOOZPlOl A Based ori W'j p0u042P»I0; ER 106PP:4 Al Based :>n WO' 
IiP p0P0-4]C:::B7 Pi^P^PjlR-j; 'iP 1 PP'^- 1 1 •.■4 4 I ;^ ::^0 U P ; CP lPP9-n:: 

\ -I'liMJ'if'f';- • If;" 1 1 1 I ^ '-^ ' 'P ^ ■ " ; ''I" : ^'P'-*-": 5 RPR;' 1 " 'I* *P'! 4 ; ' 

l'-^':^'/- ^ly? :\'_^ 2 IP 0 P u 9P ; ijp I P 4 P ■" :■ ^ :' 1 0 3 .' ::'.P: ; P.-" 1 P 4 1 ^I 0 7 I 
IR99101^; tPI 1 P P P- 4 1 :■ 7 "! 0 ;^ R.C I ; I'P P 'I P - 4 1 c 0'R> 2'":IPiPIlI 
ICM G06F019-00 

VJOj P 0 0 0 4 I: P 6 :j A i J F'AB : 2 0 (J 1 C) II 0 

npiVELTY - Rreparing rec'jinbinant riU'Vleic acid (I) zrord ^jl ig^jnucler-tides 
wni'_h i..<_'ra *rr^i:Hjriiu LC' ^ s^l •jl lT lax act ei sti.Rig s uL'^eQ ^eni. es (PCPS, 
■:M>mi:'rising at least two parental characper strings (PCC) oC'rrespondi ng to 
a numt'er of nucleic acids, is nev/. 

DETAILEl) DESCF.I FTIOR^ - Rreji^aring recombinant n'U'.rleic acid (I) by 
aligning tor maximum identity a number of fia rental character strings (PCS) 
corresponding to a number of nuclei c acids, defining a set of character 
string subsequences (3CS3) cd'Tupa' ismg at least twc* of the PCS, cuoviding a 
set of C'l igonucleot ides cc-rresp'Onding to the 3C33 and thien annealing and 
elongatirpg one C'l" mijre ol igc'riucler't iaes with }.■<:■ 1 yidei iise or li gating at 
least two with ligase to p'roauce (I). 

IUPEPEIIDENT CLAIM3 are also incluaed for the fc.Ilowing: 
(1; forepar mg cnaracter stiinigs ;C3j t^y f'rL-vioirjg R23 -E-nrc-omg a 
p'jlynucl eC't ide or polypeptide, p.roviding a se\ o[ 

C'ii'g^'nucl eot ide character strirpgs t^f j: reselecteLi lenrtn that encc.fie a 
riumh'er c>t." smgle-st randoii 1 1'gL'r.U'i-le'r t ide seguer.ce c-jitijiu' is i ng sequence 
rra^gmerits oi PC'S and its ccmpd^rm^^nt arid i.:iejting a s^.^t i^f derivatives of 
p^ar'-.^ntai seguen^/e cC'iT.pr i s in^g se-guen^i-e var iarit strings, >.i sot jf multiple 
mut at ii.-.'ns with L-ne m.utaticri per variant strm^j; 

(Pj a ilbrar-y ra^epared hy the ah-cve said m,e:h<:d; 

(I) f i 1 i t at i n- r rec'i mt*^ nat I'lri pi-vt'.veen tw:^ -^r rTi-::?re diver-iient nucleic 
acids l-y aligning PCP corr esp-i-ncin^g tc. divergent nuc:lei':' a:::i!ds, 
Lleritifying regions of sequence identity .jnd re^iic^ns C'f se^quence 
diversity, defiriina a dipC jmat '73 vjhi::h is i nt e rme':; i 'it e in RC.7, 
syr.thesi ::ing at le.ust a p'jrtic^n of the dipljm..-it ie'iru-jn-:'^ tc pr-i-auce a 
dipl'jmat nuclei-;- a-rid and recc^mPdning a mixture zf r^irerital nici^jic acid 
and diplomat nuclei? acid; 

{4} a mixture of selectea rru^Ieic a'.Mds pr^raured oy the at'ove said 
meth'jd; 

(5j generating and reC'jmh'iriing nusleic acids ty irij^utting a number of 
amino acid sequence character strings (APC3) int: a iigital s^'stem, 



L^-j^ise L I id I ii:o A^'Co 111 the digital 3ysto:ri to a riuraber ol nucloic dcia 
charactfer strmq^; v;hi are species r-odi::") oia:7foi m sfd e'-:"" -oi e:-:pre5s iori 
host and with opt iini z-.^'i sequeri-je s i mi I a r- 1 1 y b'-t w-eri a number of ni..]cieif:: 
acid ::h.;lr^^ :"ter string:-; and synt [le.: i zing one or mt re o 1 i gc-n ao i es t ides from 
or.e or mi'ore r'ever'Sv:- r'ans iat ^rd nu::leic a^'id e':3ueni":ec ; 

f ^d ■ 'H't i m i z i n<;; a :tivity if d niolei'V aci'i by pa t antf^'-^ r i z i r.g .j nai:il::)''.'r 
■: f riU:il^io a:: ids or proteins to proo^ide o set 'jt 

ir.uit idimens i'jnal da tajr^oinr s , ext raco'Latd.ng on^- or more pMjstulate'i 
iT.ul t i d 1 pens i : na 1 -la t -ir": in*: trsm t r.e set. '^f iiio 1 1 idirT:er:siS'na 1 datapoint^ and 
:cr.verMr;g the josstuLitcd iTi^u 1 1 i : ms r.s i on.^ 1 cia^.^pidnt to a new CS 
sC'rresp'irL'-ii rr:] t ■:■ a po.-taiateo irioisis: ao.id or protein; 

an iriL^iai iibr'ary s-: : least ah oat lo^:' re':t: irb inant ntr:-d.'r L'^ :::tio-:-, 
:-smp;iisin; jz i-Oj:::- .ro-^t IJ"; diifsrent n tn- i ^i-n : i t a I an it 3, s yr^r i di z irs: 
:ne iil-rary -a: ins- s^r ir.t'r'e ps'T^a La t i':'n t-f na';:ieis: aoias tt.ot at r respc n<::i t3 
snc or itLae su::iseqaen ors in th-:: aiffertri\t lirooiy units; 

) r;ne enricho'd librar;^' prr..ia:>E;d r^y tne ab I've saio rr.etht'd; 
(o) v|ert':'i at i nt: a lihrary ^-f hatds-gicii rod yrr^^E-rs by generating a 
divei S'/ p'S^ alat i'ln :d Cc m d ^-oiaC'iter, wnioh m ta:n dre gen-rated r^y 
altera^ i<:a: :d pr e-^ia-: i st i ng t^s:, synt ne;oi z i ng tne tiivsrse ptpaiatisoi s-f Co 
in wni..:i; ':iiv-tse rM.'{_a i d t i ori os^ir.r- 1 i s -rs tne Lir-rary ot iaisLiaicdi r'SlyuLors; 

{]■'.■• ^n i nt -E-g roi r -a st'Sts-n. i-c^rrpa i s i ng a -'-orirv : t: r hovins: a fir.^t oata 
::et o :i\r risina a tirs" .== ::rita Oat3 .:et <-omf: r i i n':] a ;:ee'.a;'i tt3 , 

stftwEir':' for aligning the first ana seot'OO C.-,.., .-sdhware fsr r^er- f t riro. ng a 
gene^:io c^j/e r t i ■: n to; the first 'tr sooso.o hid ari :ot{tat tile otmr'r i s ing a 
tnirtl 'Uat s-t osinr'r i .:■ i nc a t m roi og^ third 'h:* oC'Itiio?: i. s i ng a:: 

-ai:s-g-o:n zos frtii. ^n-;- firs- an-; /^s-Z'-'Ss: tth: , and !■ 1 i 'o: na r I est i d-- 
.OE'ga^T:■--- ■■'aoi:'a*: f i br ■'■: in]: r i .r ing a rhiraiivy :f s^osr i u:.}S}: i rig l^i i gS';;ao i ec;t idc 

VsE - Tr.s- rt.etr. :-a is -^sef^l h:-r r.;^^di:i evolution C'l nuol^-is 
d:-ids .n vitrS' ana in vivs' ana r'Otvioes t ji generaticT: ot enct'de-:! 
it:de.oa_es wi^h new hvoi/or iniprsved rjr s-p'Or ^ i es . Proteins an<a 
naoieio a zids of i n^diist ria 1 , ag-r i z a 1 1 ar a 1 and tneraiteutio impt^rtance can 
be created 'Sr im.p'ri:''/e'.l thrs^agh hh'A snaffling pirc-zedur es . 

Ai.iioAt.d A'.-.!!. - trTs^asical aoo>^ss ro genes t>r SirginisiTiS is nsit regaired as 
segaence mft^rmat it^n is ased f':-r design ana selet:tis'n ot tdigt'. H::<tensive 
sega*;^n'ze mf t^rm^it icoi is prs-vidc-a ansi se:ra':-nces frsiu i nas^ce ss ird e , nt^n 
zaltivahl- s^rganlsrus zan also h-r used. J'stjaen'ses frt'ir loatntgens witnsat 
actual handling of ratnt-gens ana ail tyr^e seguences inc biding damagea and 
inoonfdet- genes are ■umenatd.e t: this metnS'd. All genetic s-perattrs and 
zrc^ssC'Vers zan te fully ana i ndep-en^dent ly ■zontrsllea in a reprs'du z ird e 
fashvcTi rems>ving human errs^r and variahdlity frcir. pih^'sioal experiments 
with C'dA Jianir'uia t i c-ns . ^'s-auences with frame-.Ehift iiiut dt it-ns are 
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od.d furtt.er i ni^olv-'S synt nes i s irig tne 
pC'Stalated nu::leic icid or protein, analysing principle 
z:.mponen*:s :> f tr;e zet E^f :iiu 1 ■ i :ii mer.s i :oi a 1 dat.a pEunts, 30iuffiin;i "he 
p-E'St ulated nazl^:'iz at:ia vd.t.n ar. addi.:i:)nal nuclei z aci.ti and p a r am-^ t e r i z ing 
t.ne set ot ri..cleiz acrz:s :)r proteins by czu' re la t : n :f eazh res i die 
ti' a matrix of rsameriz indi z a t ■:)r'S . Tne mati-ix i£. graph::: ally represented 
as a t et r ahe-dror. roiviri:j an assigned tirigin at its center witn eacn zorner 
representing a numeral v;ith eazh residue being p:)3itioned at different 
scrner tnereoy produzrng the matrix tf numeric in di cat :u"s . bach 
m.ulti dimensional datapoint is correlated by partial least sguare 



, J- J e t i C' I'l o to 

1 elat ic:n3hif ' botween a matrix of !"iop'ei .dc^nt Y varial^les and a luatrix or 
I -rt.Miict'jr 'A v,;i r iab.^'js . Each mu.L t idimor.s.. jnal dat:api''i:.t o-uuprises orio or- 
jMjie dif ferent par<-ii:ieto rs that are p.Lc^t i- •= (i again,?*" e^-.c.Yi '-thei' in n 
d iriiensio-nai hr/fjerspoioe coinp'rising at: le.ist ^'-n^^. din^-n.:ion fc-r eacn 
:ouram-ter. P.-'tMi i at i ^^n t-f nucleic acid i;.. fined to ooLio substrate that 
■ 'C-riijr i ses -o^unn matrix iriaterial an<:; ..i riU'CL^^ic .:''lu ::h i p . Initial 

i. ibiary is {.ir-.-duced loy re ::^.'ml:'in inq one <.u' m-ur-- fior; !■ 'gous nucleic a^'jrds or 
r y rir'tvioincT r'rtS a:ii': pr c'V^iding a set 'jf i - :ijnuc i ^-'''t . ties fc^r e bjnaatinq 

' h^'Ui v;: th } ^ob/ihera s ^ . 

The integrate^! system further cranprises an I ^gonu'.' L- 'jti-ie syntl-esis 
r .::;-n.:-:,.* * for ru-r.oho-:-: i z i ng 'jv-er iaf'pi ng r.U' ' 1 eo- i den , n .:mb- r <)t 
■ l.^T'-'nurie't" :'-u:u3 er.':-o*-ied by over iapp.^n'.: tr i cc^nucnL * ^o . des and v;hrch produce 
":ri>r ':nira rrjci^^rt: a'":-ia err:''jded tyy thr.ru ZS \/nun i n'.'ur .n teu .in (jue 'jr uu„ue 
.'ycios t' f chair, ext ""i-ns i 'tn anc a prc'urai'i \vi.t:, ,in 1 n r t : tt i to ^et ftvr 
..ip}.dy:nc one laore -genetic 'tptrator t the trrst or ce-tnd Co. Ihe 
tp'orator is a :tuta^:itri, a rr.ul t if>r i cat i n , a t ra-jinen t t ic-n c. f th*.;- string or 
strings, a cr^r-ssover r.-etween trie or more strings, ligation of strings, 

ii. e.itism 'oab-uia^ I'tn, an ai n:jnrrient, a ::al cuiati t n :-f s---guence n'tm'jlogy 
t'r sequence s irrd lar 1 1 y , a recursive us-t -yf ■■■no or nijre g-^-netic o^f'orator 
t'jr evolution of C:>s, rarvii'tmn-i'Ss , a deietior. riutatiir., an 

:r.ser^^-on iriutation, and le.nth. 

yF:Cl[ht)Lt>GY F'jCi'S - hit'hfjdi - troftrred M-jth-ci: Co :::-pTises c ne region of 
Siird iarity vdien ali-.;n-tl f'.-r nax^raum ^tientrty. At i-^ast t-ne PCS is an 
evolutionary '.u lirtifCsiii in" eiite-'ria t- -.u-iC ^ r se. u-c-' :: ^-^ a 
designed nu::leic acrd v/trsh represents an energy rnnin:dzed design f'tr an 
•.n:o:ied polypeptide. 

ine raetnoHi corip'ris^-s -iiiC'cd v' mg -jne t^r rl'.: g-nr-t xC it-.- r' at ■"■r tt' o:"iO or more 
FCC-?, or one or rrore too. Tii^ gen-jtic op^'.uejt'j: -s: 

':■ 1 riutat ] nLu:d. ltd icat ion tr f r a rrrr.ont a*: r tn or tte o:.o or :ro:ro. i-'CS t^r 
t ne t r tncre tt:! subs--;- qu-E-n t-cv ; 

[2] a crossover netween any :d" the one or rn.r-^ Ptd.d r' j:te ■: r m:-ro CS 
s u;!) s e q u e n '.t e s o r an a <:iti i 1 1 1 ■ r . a i ■ t C ; 

(3) a ligation of the -tne or more PC3s t-r t'rie <tr irrtr- Co sui: sequence; 
{4) ari elitism calculation; 

(S) a calculation or sequence homC'logy tr s-.-guence siniilarity of alrgned 
strings; 

(G) a recursive use of one or mt;re genetic 'tp-erator for evolution 
of CSs; 

(J) application of a randt^mness operator to tne on- or more PCS 'tr the one 
■tr mc're Co suosequences ; 

:; f.; deletion, an insert I'tn or sut't ra^zt ion mMtatrtTi of C'ne tu more PCS or 
■tne 'tr m-tre C3 subsequences vvitn an inactive sequer^ce; 

(9) selection of the one tu' raore PCSs or one -i^r more 'tSs or or.e or more of 
CS subsequences with ari active sequence; 

(10) or death of the one or more PCSs, or ^ne or m^ire of Co subsequences. 
The method further involves selecting a dip! :cLat s-.-quence c-tmp rising an 
intermediate level of sequence similariLy oetween tv;-. or mjre Co. 

'ihe recombinant nucleic acid witn a desire i ^a t-peL ty is -elected, 
recombinant nucleic acid with one t^r more njmologous nucleic, acids or 
oligonucleotide uni^s for desired trait 'tr :or:jperty is s-Le::.:ted oy in vivo 
t-r in vitro- seie:ti'jn assay, t*r by par.^llei stJid phase assay. Recomh inant 
nuclei'-: acid is :ieconvoJ.uted r-y sequeruing ■■ir digesting, sequenced, cloned 
-■■r synthesized by errcr-prone assemt'ly frjiyrierase chain reattion (PCR). 
Preparation of Co further rnvolves reiterati ve shuff.. ing or selection of 
library cf rezi-moinant rracleic aci:ls. 

Cenerating and recombinmg nucleic acids furtner involves f t-at:mentation of 
one or mc^re elt-ngated nucleic atids, hybridizing tne sec-tndary fragments 
with each other t)r with a set of primary fragmented nucl'.'ic acid that 
encode amino a ziti pjolymer. 

Pre-existing CS is altere^d by rec Drribinao i on . Biox-jgi :al polymers are 
selected from nuzleic acids, polypeptides and peptide 

rrucleiz acids. Additional library nr additional set jf CC is filtered by 
subtrazting additional library or additional set of :S with units of 
biological polymer library which display activity below or above the 
desired threshald. 



t'ref ^rrrea 01 1'^onucleot ides : The s-.it ot o i :|onucier,t ides comprise a number 

■if ^)ver 1. ippin 3 nuel-iit ides Si. "3 is d-^firied by sei-iting a length fir 

I'S and subdividing at least two it th- ?' ;S inti) segments ot selected 

ieng-.h. The set of ■:) I igonuc lee t id-.-s is provided by synthesizing the 

:d. ig-:>nueleot ides Id conprise i-ne '.'r m-jre modified PZS subsequence. 

Tne .-et jf s ingle-s t ran jed il i -ion- 1 :l-:-ot i ies t:nat c^rrespind t j SCSS are 

p-:)ol-d, hyori'iized and exten'ievi a-.- d-j' ib L--s r a ride' i nu:;iei:' acids v;nich 

■.'■jinprise sequences trim at li;r.st t-\-;o of ih--- PCo . The joubie-s t rande :i 

r:u':;l':-ic i^ids are densiturei t :■ pr-:.iu:>j a hi- te r :>genous nuxiiie of: 

s i no . e-s t randed nucleic icids whi'-n uie T-hen r':?-hyr)r Ld i le :i ani -j^niended 

v; i t h p'j 1 y me r a s e . 

Tni' .--e-: -jf ^ 1 igonu i.. e i^t i ie i-.. i.yn- ne.i -.ze "i *: a ;i:irLp r ..s>- ou-^ ■)>: :ujm- :[[-:iiified 
P'"o uri'Se quen :es c Diripr:: .3 ^n ^ : 

ilj i replacement, deieiion, inseriii'U, in:Lu3i:)n -jf ie:;en-^ri"e sequence 

i rie oi mjre I'andC'niiy -'.'r n-jn - 1 m i<jid y -eieii-cl r-os... i i : ri;: ; 
[2; mi las Lin iif c3 fr^rm a seC'jn:! PCS subse :t aeri :;e ii ^ne >r :n:ire 
p:si ■ ions; 

( :d a PCo subsequence C'lased Lv^sO'.J urMa; its frequency in .i selected 
library ot nucleic acids; oi: 

(4) PCS subsequences ci'irif ^r is in^j one iu~ more a r t i f i ■:: ia 1 ly includo'd sequence 
:(!::■ t. if su::h as an Il-linkea q I yc^'S yl ai i i^n seqi-iince, .in ("»- linked 
qlyC'jsyiat 1111 sequence, a prMeas-; sens i*. se^au-nce, 5 c :d l.:gen.ns^:5 
sensiti'/'-- sequen:e, a [vni - ie[ji'ndent *: r ans - r if)i i '^na i ^ rrrd.r: ii^n seq-jence, 
"in PhA seiondary structure 3oq':.o:iOo ■:hat err'i.'Cts :he f f ic j -^ci :y o^ f 
t. r anscrc^f.a: PlJA seci/n^-iary S':ruc*:ui:e S4-quvr.ce tn.;c: ^ftO'Zrs ^oie 

efficiency of translation, a t ranscriic tii^nal enhan^jor sequerr:e, a 
" ranscr ip't 1 iiial prcmiter se-queni-j jr .j tr .uni irif: t ^ona sil'jni^ng seiquence. 
The ■: 1.1 giiracleot ide set ■"iDntams iiio ir i::Oi-- alterel :r ie :;on'.5rate i 
:."::-s 1 r 1111s ci-mpared t-i ^nd ■;-'i!:i}- ^•l 3*"'S a ::h i ri :: nu-lei.c 1:1! sequence 

■and -i;! lirast ■me oil 4ora^'i ^eotiae un^t wh-ji*- s Jijseq icn :-es ai -5: separated by 

■iti"'SS'-)'^er □■mm whicl'i is r-o^-- -^'^ i idii-i^ i f y in :) -i !iumt:;ei o :: PoS, 
dlignmg tne sut'Strings ta 'display f)airwiS': identity r.eti-jeen tne 
substrings and selecting the poi.nt v/ithm tne ali':ni'-.'-i sequence as the 
■.-rossover point. Tne crossover c^omt is sel-ct-'d lan^li'mly or non- randomly 
by selecting approximately m the middle of cne or m-ire identified 
^■airv/ise identity regiiTis. The metho'-i further invol'-z-L'S adding one or more 
oligonucleotide units of the set of ol igoriuclei'tides at a cc ncent rat ion 
hig^her trian additional ■! 1. igonu^: loot ide units and in'.:utat ing them wrtn 
rec'imbm^irit nuclei'7 aci'l and p'0 1 yis- rase . P:e':ombinanT. r.ucleic <icid is 
denatured, contacted with additi<:'nal nucleic a:ids ^'r^-.duced oy cleaving 
P'Urental nu'rleic acid from PCS ly clnemical ivleavag^, using I'Uase i^r 
r e s t r 1 c 1 1 o n e ndo nu c lease. 

The oligiTiucleotide set comprrises <me or mciie 'jlig'.i:U-.;leC't lae units of 
iO-hO nu'.^leot ides in length. 

Preferred Protein: PCS encodes eryt hr ipoiet in (EP'j;, insul in, a 
peptide iiorm.one, a cytidime, epidermal gro\7th facti'i, fibrc'OLast 
qri'V/th factor, hep^atocyte groivth fai^ti-r, irisulji.n- 1 i k*:; growth facti-r, 
mt erf erin, interleuhms , ker at ino-ryte growth f acti r , leukemia inhibitory 
a'.-:ti'r, oiic.ostatin M, PP'-SCSi' (uncefined), p-lateiet deriveLl -qrowth factor 
(PfH-^F) , f.l eior ropi n, SCF ( urnJef i ne^i ) , c-iit L.gan'.i, ':KGF (undefined) , 
G-CCF (undefined), iiicoLfene, tun:':u- sui:p)rosS'ir , sten.iii.i hcrmc-nc^ recf?f)t or , 
plant hiirmi"tne, dj.sease .resis-^ au'i-^/ .;ene, hert-icide : * s .: s tan^oe qfuie, 
L'a'::t r-i 1 al geri*-, mc ni^jxyijenases , protease, riucieas-' lip.-ise. 
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TI r '^'C'lmbming homolC'gous r:ucleic a^^iTis to fioducM f3:;LiJy shuffle nucleic 

a-jids comp'iises hybridii.^ng and elmgating a set :: f:umily gene shuffling 
':lig(jnucleor iides armi pro'/idmq a p -ip^ulatic-n of re c<„'ml:'ined nucleic acids. 
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n:.cleic acid sat^seauen^::^ ■::':■! rear:* :.-ndln':: *:o 

; r^ :;.'.::nr.>in^r.g n-jrriC'iL-j'ju.r r non-homologous 
nucleia a-jld sequ-n-::-ri havj na io\-j ae'Cruenae siniliarity :M.TTipr i aea 
r ■oirti: ' 1. r 1 i ri q > :■ n e Ci r m ■ :> r e set - : f f r a r^m e : 1 1 e ^ i n u c i i ■ 2 a ■ :: i d s v; i r hi i set of 
crc'SS'jver nu'rlei'i- acids whicFi i ndlvicualiy C'jmfi'r i se a riuiriL'er I'f sequence 
diversity a'jii.ains tFDD] ^-'ji r esporidina^ 10 a nuiuh'er of 31"'l' from r-jiTL-jlL-qous 
or non-homologous nucleic acids with low sequence 
simiiarity and T:'r':.H.:iU'2inq a r-c: nird nant nuclei<;: a':v:d; 

(3) providinq an oliq'i'nucleot i'de set for re^irc-.mrjinin'j h<MTi':'T':'q.jUS 
nucleic acdds comp'rises: 

{1) aliqninq a number of (I) to identify one or more regions of 
sequence heterc-genei ty; and 

{11} synthesizing a nurr-h^er ot 'different 'jl igonuL-lcit iae units 
cr.rr espc^nding t'^ ^^t ieast or[^r of the regic^ns of heterogeneity t^j tMOvide a 
set of C'ligonuclettides c>jmpr i s 1 ng i re^jion or sequence net erogerie ity 
C'tr respond in^7 t o ( 1 } ; 

(4) famiiy shuffling F':B. amp Fi ■:; tris c^tmr^r is i n^;! : 

II) pr'jvi'ding a number ''.'f n tn-nc^m^^gene-jus h jm^ob:.-gtus temrFi ^iie nucleic 
acids; 

(II) p-r.:.vidir-:: a number ot T'CR puimers wrii'iTi hyr^riaize to 
n^:r:-hi':.'m':.iger;e':'US Fi'jrTiC'i'"ig:.'US t^mr.'l.jte r.ucie::: i::i':;s 

(III) pro-:iiCin^g a riumrer ot PC^\ amp)L-Ct)ns )?•/ F::F. anujl 1 f 1 :at itTi ; and 
( :v) I e'::ombiriir:q amp-li^Tons *: j r^rovid- re ::-:>mbi n i;nt nu--l'-i^:: acid; 

(3) rO'::<jmbinirig a numt'er of parental n'iclei^:: a^iiids -ojiiipi r 1 s irig 
lij.:Atin7 rr -l:-n;iatin^ a sot f numc^er jf d 1 i ^1 jr:u tie 1 1 1 :ios tomprisin;: a 
numoer of nu^tleic aci'i sequentes fr^tm parental riutleit aci:ls to produce 
rec':)mr)inant nuclec':: a^tids eiic-jiin;! a full len'jith protein; 

[o) rirtdutin;: a rec jmr- : n ant nuclei:: -iCid zc>:i\or is irrj ; 

(i; transiu-L.ni: a population of cells ;;itn a set of t>-te r lapping (II); 

and 

(ii) c^ermittm;: rect-mb^ nat i tn to otcjr betv;een tne set jf fll) and 
one or m^tre r.uolei:: a:: ids 'V-jntairied v;itniri the r>:)puiation ot tells to 
pr^Dvi^ie a p'-puiation of re^t^tmbme'i nuclei t aciis witnln tne resulting 
piopulation c>f recomriinant -veils; 

(7) a p)-jpulat io'n of rec^jmbinant nuclei t aciis and rec-t^mr)inant cells 



( J an arrL};>lif ied nucleic acid (III) pr-jduced t) ; 

a cell, a vectc-r, or a virus produ':ed by the ab'i-ve said method; 

{ ] '■) ) a ':;i:iapc>3 it i'jn conip'r i 3 Lnq a library of -^cl i gc^riuo 1 eot ides 
cc^mr: r i.oi ng a numtn-jr •:>[ cd i crjnac Loot ide uni'O: oor reopc ndi :\q tc^ a number of 
cutz-se^qio-nce reai'i^ns -'A oe.h^-i'ted :o-t of hcm'ologLU3 target 3eq'.:en':'e uni^s; 

(111 reo^mtd niro tvvri or mor-^ seqaenoe;:' ooiT;p .1: L.vi ng : 

fi! aligning tw.:. -or mc're nU'::ieio acids to i^^ientify regi-^ns of 
identify and regions ct i:n vers icy; 

([[) f^r'jvici ::a a ns-n- -■quiu/id.a.r set 'i-f sd i ';:c ru :d. e^v- i 
zo tW'j or mc'ie nus'.leic: a^vio sequenL-es at least a r-gin-n ot 

nurrr e i:' 'jf i e^:''' inrd :'.a:\t nU'rl-ric arij:!.:'; 

( h: ; ma;<lng 3 literary or 'i-n lit e r ap h^ st : ty iroviomc 
nc.". :d■o^:■^us cnime r af d. jst s *:a::h : : i:it: r 1 si ng a raaiKor 'or ^^rr 
seque:\':>;. sirr.ilar i ty ^nd a^ least a r^E-gito; f .^•^-<::■:s■r. 
s-dU'i'i ]'ig a liliirary f '::ha K.erapl a^o: ; an-l 

(I.?.; a litM^ry p'redu-red in (11). 

hoE - Thtr meth^ol is useful ^.t- jircauce s^ hiirdly tf shufflei- nucleic 
acids, to proauce r e I'ombiriant mt lec^ules with gieater n/; leculai diversities 
a no for generating classi<:al mutagens. 

ADVAMTAilE - Hf.-ii'cd oo'.-u:: nu'ti e i .Eicids with i 'jw soi^aence similarity or 



^: to r resp'jndlng 
divei'sity; and 



n tirjjei' ■: 1 
d i vfor s i •/ and 



jiTjj 1 n 



d',-- f rauia-rnted 



; r.-h'tm tl t '-^f tus nu 



1 



MC 



nt-n-nt'imt L'jos'Us natieic at^itis are e^;:sily r* 
cd igijnuc loot ice oe:rdved i r t^m ht'mt)iC'g.:.us 'tr nc 

r ecC'iTiblnat ion . 
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EPI: TOl-J 
TECH i;PTa: 2000'") 0 5 

TECHN'JLOGV F'OCflS - BIOTECHM'jLOGY - Preferred ti li gonuc 1 et- 1 1 de : 
(11) contains overlapping nucleotiaes ano enctoes an evolutionary 
intermediate nucleic acid. (II) further comprises a nt;mber of 
oligonucleotide units having ccnsensus regicn subs-quen'tes derived fr-tm a 
number of first round screened nucleic acids or derived from hcimtd'tgous 
target nucleic acids. (11) further comprises rmsdule shuffling 
oligt'nucleot ide units, eich compirising a first subseoiuence fr<.tm a first 
sequence module and a seoon-i subsequen'te fr-sm a second sequence moaule . 
(II) also comp'rises a number of c-tdon-var led <;d i gt^nu 1 1 eot ides and a number 
of oligonucleotide units comprising at least :.-li) units in approximately 
equimolar amounts r in n'jn-equimcd ar amc'unts. P.ec<: mt i nant mtcleic acid is 
selected from one or more desirel trait or prt'perty ind enct'des a full 
length protein. The set of overlapping (II) are t^.hi me r- af-l ast s . 



; e 1 c a-:: ids 
si i t a : Ing 

: 1 tie - air-'Cted 



&12-?;0 4F; 



A set of 



The library of o 1 igonuciet^t ide 



:omriri; 



•ast 10-:0 



:ir mt 



■:d igonucleot iae units in n^jn-equimo lar amounts and each have a sequente 
identical to set of h'jmcd ogous target sequences comp'r i sing at least one 
variant sut'setiuenc';.- . A tiumber of subsequent re :;i'tns ttmiprises 
non-C'Ver laf't)ing sequence regions t-n th- selet^el s-.t t^f ht^m^t :o.'gt>us target 
sequences. The ct^mp'OS i t i'tn comprising a Horary t-f : 1 igonuc 1 etd ides 
further comprises p.o L yme i~ ase , thermostable [)MA ^rcdyn. erase, n:i'-^.leic asid 
syntnesis re:igent, buffer, salt, tria^nesium, ^ir.l jne M' mjre nu::leit 3::ids 
comprising a number of hsnhoiogous tartTOt sequeri'te units. 

Preferred Method: fp) involves ebongEiting usin:.T polymerase -tr ligase. The 
metht-d furtner t.omrjrises denaturing the population of r^e ::timb i.nant nucleic 



a::itis, reannealir.g the denatured recomoined 



tils, extending 



li':jating the resulting re annealed nucleic acids ann optionally selecting 
one ar more recomtiined nu'-;deic acids fur desired pr:*perty. (R) further 
involves rec :.)ribini ng the reannealed retDmbinant nutleic acids. Numerous 
populations of recombine'l nucleic acids can re screened and determined for 
a desired property tc prt'vide first round screened nusleis a::ids by 
hybridizing a se::ond set jf (II), derived from first round screened 



nucleic acid.s and ^longatmq the -.^-oni :-;e*. ■?-:: di) t pr.---ide a 
^ -pulat i-: ri furtn-.-r rec "rnt.'in^-d r.i./ieic a :i ; : '-(^uenc-.' . Tr..^ ;ecjnd set of 
(l":) comf rio-s num-.-i '.'US o ! i : ')nac 1 -c— de s c 'mf ■ t: : o .^ng s ■ .tse I'lencea 
c.rr^^spondin.^ to a* Lease one reqi^.-n o: :.l l ve : . f:y . Th^^ co:nr .... : it i on 
C'lrr^pi isi::'^ at Least -..ne f ruarMerit'-;d na..:ie.: c acid and a p':)pal .iV i^jn '-if (11) 
i.:- p-rovL'.iei i.)y aLiiriinq n .n Ic. p> k-; nacL'.;::: a "i 1 ;;equeri ce.-^ sele^^t 
cor:.-jTtrved regions f S'-quen-e id-^n- ity -irid r^g : ^ns of i..-r.-.:e di-^rsity, 

cyn^Nesizin-^ a nurru.---:- ■■ i [ID - -;p.-n.-i ,: n^j ^o ;-:egi>-n -es -^^r.pr i s ing 

rv;gi.-..)ri3 ..-i: sequenc-; diver... ty, ra-v-im"^ a tilt leivith na-l-oc acid 
i d..a-i*: i-:::a 1 to cr no:-M d 0'_rcu.~ witri r^- cr.- .go-.s na:!-^^:: :iC^^i3f : i igmont ..ng trie 
i all. ten-:th nu::lei- K'V clerivag- v;ith dlTise -r Uy ijar'tial .::najai 

^_..;rr.;at:ij:i, and :Tiivin:j tn- reca_*..c.j .:er: :> : n:::.'-Lc ..i d ;i :: r ".. ;:!i-;-it wich a 
r.-.ntij-r ir Clj to :r:c/j';.ie a :onf-: - .-^ : '.on ;:)rpri3:c.g :d: t ::r;v:; nt-^ i nac-ic 
.-^.13 an'-: a oopula: ivti of 'ZD. on- motnid l:artiv.o:^ -.nrris-.c sele-ting at 
^'ru5t a ys^zono rul . ler.gtr. nuci'-.o. -ac i.d jn-.i de.i /lna :.t. : provide a 
ce'.:::;nd s^" t ot nucl'-ic aci J r ra jn-^ rii: o wn . :;n i .> n :_so oo-:-:! wi^ii pipuiation 

(ill. (r:) are added t- tne ■-.■.-irvs 1 1 i-'-n by a.._ign.ng ^•ji:iJioqo-j.s- nucleic 
,u--d seguen.-:eo, selectma at least one ::.nserved re'j^Dn of -egnence 
laontity and a nurrir.^t ^.l^-egions of sega^nce .diversity cornprisinj a nanber 
r-t domains and s ynr hes i ::i ng (II) C'lr resp-ndi rr:| t- aornains of sequence 
diversity. (II) causes doiaam swit-hing with tne fragmente^i nacieic acid 
m the regenerated rec'jmcmant na.'::-ic acid as c.jn",ra:-ed t^:- h-jmcd 'Og dus 
i:acleic acid seguen^-rS an-:i ere-:, les : ne or rx-r-- dom-ns of s-^pience 
diversity. Fragment, e nijrlei: aoi:i is co^Bined by ol-'Mvinj a c.onei 
nucleic arid c-r selecting a nucleic acii soJu-v::oe an i by syn thes.in mg 
oligo nucleotide fragments cc-rrespc ndmg to selected nuci^-i...: acrd 
c;^,-;,jen'-e . A numoer ol igonucl'i-.,- 1 .oe segaeri are synt^hes i c-j i serially or 
m ^:.arailel . Homol-.o^o as nu:!-:!: .::d seruen::es ire :aign-d vn a system 
:..:anpr:ising camputer v;ir:h siftwar-/ br-r S'-gu^nc- ili:in:'.?nt :c: r: y rnanual 
.uiianment. 0 Lig':)nu.;i-ot ice anit ore seiejt:e':i r^y alij:.ing a n:mher :f 
b.:m:l^:^cus t::ir:;et coga^;nce-, oev^r-onin^ r-^oions ci ^ientdciy :ind 
diversitv, synthesicmg cli g^: nucleor ides to er.:ode ar; least .i portion of 
region of identity, a region of oivorsity ^-r pjth tne regir^ns ana 
selecting them. Sequence diversity doniair.s correspcnds to aciacent 
sequence regions of homologC'US nucleic aci'ls when aligned. Producing 
chimeraolast library further in^.-rludes transducing a r-op-.u 1 a 1 1 iii of cells 
with the library and detecting the r ecombina t ic^n ot marker or ether 
regi'ons of similarity with one c^r more nucleic a^rids to identify tne 
recomcined homologous chimeraplast and thus t i^ientify active hc-rr.:dogc.us 
chimeraplast . Active homologous cnimeraplast s are j ecC'mbined t.i p.r.cduce a 
lir.rary' which are then transduced with a second population cf cells and 
identifying additional active h^ju-dogous chimerapdast s . 
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t: dovel methods fc^r s^-rlectmq target sites for, and pr .:^auct ion or, zinc 
finger proteins, useful i:>r (xntrcd.ling express! cirj target 
gorges, e.g. for inhibiting oricC'g-ri^s cr treat Lr:G siid:le cell anerda. 
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lOM :'12'.^ i: G06F017-50 

Wi;.^ 2C":-J-L;.:19 A UPAb: / 0 :"i J -j V) 5 

NljVELTV - :::el'-ctina a r,_,r j,z.- 3Lt-^ (TG) within a nu':.:.I-ic aci^i (I) to b'e 
tarqe*:^'! L-y a zinz t'iri-jK-i protein (ZFP) t^y 'ietectinq a specific 
1 ; -ba:^-e ir.'. r i f :A), is n^-w. 

:jr:TA:2ELt T'ESCF.] PiT'iM - Novel nv-th^l f : r sele-tiroj a taraer site (TS) 
v;i*:.:\ir. a :.uci*E'i.c a':'i.d {Ij "E'j ;:e :ar';ietoi h'y a zinc tinqer protein 
{7S?) b/ te :: I i r.a a --i^'/Liic 1 J-ba::-e iL'jIII 'J [ fo^ri^lrt (A). 
V bNzc- (Ai 



C t _ > - T I , . 



.^ch of 



b) an*! 



lo, r.; 



at i H- :j t I 



i r: : 1 uc--a for 



1 '— t o i 1 ' J w i ri : 



N - :;ny nuci e-^t io^r ; 
- G or W 

IMr'EF-EML'ENT CLAIM:? :-ro i I 
i i ■ ,^ol'z' -tin'; a TJ- to/: 

(i- o:lor:t i t yi jo: -n t i i I T.A: :o::m;:- r- 1 i ng ':nroe contigcjcs tripAetc; 
(LI J determining :1:'0C<:' re^ f':-r each c^: mr-inat ion c-f tripieoc and 

p 'O:' i t i c ns ;jy arp lyi.na a r r tE'.s]:o nc-nc'i^ r'ogiiTie; 
(iii; C':anhdn inc 5ahc:orec for. all :hiee triodetc tc. give a ;ECC're for 
i i:iot:enlial To; 

(ivj repeating ^he j:.r<:'Ced\:r.r f'Or ut ieaot one other potential T2 ; and 
(v; t.utp-atr ing at IcaOt I poz e:\z .la \ i';:; t'tg^Etn^r with itc ccC're; 
(o; f:-r':idu<:-ing ZEE ccrix-r icing c:': nc : r act i r-;; a d<:;taijace ct many Z?Es 



c-an::: 



rc.iti:aiiy ny tne indiviocal fmgerc ^f tno ZEE, and in th^: came 
coder 'o:- -i i tne ting-rs are ar!:3ngeo dd zo C' ii\ 

protein, : den : i f y ^r.':: finoerc tn.::: cio:;c i f i ':-a I 1 y nine; t'j ^-a rn 
tri-pd-rt, ana ■tut]:'a:ing ( tab i ■eic-.e i ';jn = : i c nc 'tf tn- r-r levant ZEE.:; 

( ■ -J :-t-mf:at--r }:'r t-':: r E^ni ft.r Orb:-'::ting i TE, ■:-oniro- o= i : 

C'i gO'ET: ::e , -oj 1 cu ba t i r.g a ;::t'ji - bjr tn*:' itt t ^--nt i 'I E f r-jin a 
.e :utsct're=. I'tr E tripietc ic in taj, reiteating the 
ca bra 1 at it-n steppe at Ie.a3t CTice, and prt'Viain':: an 
i:c s<rtre; 

(d) J S'^'Steni f"-r celectirji:^ a TE-, cimp'rici.ng a m'snitr;^', a ;-yctfE'm l:uc, a 
r.rt-cecct'r t'perativeiy 'Jicpt-seo tt' p^rovbEie t-r rO'teiv- a p^cdynacieooide 
3e'::uence, select ro-tential TE and cab:aiate a scc're for the TE ac in 



:ie bjr 
w 1 1 ^1 i n t h r; 
'TTLhd nat ion of 
e Leot i cri ana 
atijut C'f the TS with 



Ce) a ::ompator fi'rt'gram f'tr decigning a EE?, t^jmpr icing cede ft^r a 
oatat>ase ot :■ f i rivj^e re^'i EFrs with cutolec i gnat i 'tne f-jr eai?h finger cind a 
ct^rr ecpt'Haing nucleic aci. :i f'^r -:'ach ZEE', f-rc-viding a TE , icenti tying ZFPs 
in the database thai st:'et.d f i '.ra i Ly td nd t 't the TE and 'OJtputting the 
O'lcignBt i tT.s and cal'decignat it nc ':d the ZEE's; 

(fj a system ft^r selecting ei TE, tt^nif^ri s i no^ a meirutr;^*, a system kus, a 
tcO'Cesst'i t'p erat i V'E' 1 y oiscoseo tt fo'-tviie 'tr a oataria^e of ZEE 
d-E'signa t i t:n2 and tatt^ut ZEP oesi^nations and sar^des i gnat i t-ns as in (e). 

A'JidVlTE - Ant iii' iict e r i a 1 ; ant ivi ral ; cyt'Esrat: i z; neur or-rot eotant ; 
ar.t: i Eiriemi r . 

HECHAMIEM t)E A':^id':'b' - Mt-uu 1 Eit i E-n Ed gene expi re s s i cti ry the Ending of 



oEE - EebEo:tnE)n TE is use'El ^.o tiesign EEE' zr.^t birrA t ■: 
t^igets. EEE' r)in<Ei EEA an^El ?:^n mE'-EiuLate (i^trdrd: E^r 5i'E:tiva: 



r'r^E';E;'E' 1 e ot ed 
the 



e>:p»re'SS i tT. :t ii v;:i-Eie rE;n;je Ed" E^enes . TyplEcal :d m^ny r-EOenti5ii 
Eippl i'E:a: i'E'ns, -Ed ZEE t^r tne riUEd.ei'E' itiEis tnEit. enErE)Ei- tnem, Eire: 
i, r.h i rd. t i E^n z i bEiCt'E'rial E;r ',dr;:;l Erer.es, eo. t e^ ei^^oi-E' t E^r tne ^i^eE Ejer.e 
[ i mp)l i E:at ed in Al che iriie r ' s liseise); LnEiutinEi exrires s i on Ef fetal 
ti-'mE)gl-:ir)i n (for treatirig sit-:le E:e 1 1 anemia); Eifid in rjiatits i o iricrease 
r^'sistante tt' Eiiseases or ne rrc-td :ies , :E>r to increase ■Eiiei.c acid synthesiE 
at trie experise oi Lir^ole-i':: or 1 LrrEdeoi ic a tide. ZEE' E'an al.st' be us^ErEi 
dia gn'EiSt ical iy, e.g. tt' Eiete::t \^ardarit, di so ase- re 1 a ted alleles; tj 
quantify ■top-y numl)ers td' a gene; to EieteEEt pathEigenlt miE;robe3, anEi in 
a.nalysis ot ohenotcype ar.El function Ed ger.e exrires s i E.n . 



ADVAETAZE - ZrP can ne e: EOit r ti le Ei r^y 3ma_l mtlecuies, aiLt)V/ing t h-E 



adjustment of the deEjre- -of repression -E^r attivation pr:;duted by EE! 
in transgenic animals, makes sv;itE:hing zn a EE? at a late stage in 



ana, 



!j 9 /' 4 ^-^ 3 4 C) 1 



embryonic develC'pment pc^ssicie, -o that errects c-dii L^e L udi^-d ^ in the 
adult. Nucleic acid:: eri::oJir.q a hFP can be int r :c3c^::ed at any ^ ite 
(htmoioqous i-eccmbin i t ion is nnt .r-e:3nireaj and it.r^-anse ::FPn are 
trans-dominant, c^niy I ci. taom^.'S- ina I topy need be t fe. ent ( 
functional knt'Ckc-uto :an in. or-ducei v;.th-Mit b:iiok::r -si:iq). 

FS C P 1 

FA AB; J'ZW 

MC CPI: B04-N04; Bll-:":^; Bll-Ct'^; B12-K04A; 

B12-K04D; B12-K04E; B:4-Ai 1; LU4-;v:j; Pi4-F-:'; 

314-H0:B; B14-onA4; BI4-!Ilo; D')":--':!.:; P0^.-H04; PnV)-Hi;.P; L:.i5-H[;o; 
D05-H09; L 05-Hl^;-.o; Ph' -h:":m^ 

te:h ppTh: JOojh-rio 

TE'"Hd<")LOG^'' T-(-,cuc^ - PI'"iTE'':i{r]OL':)'.;V - E'i.-'.-::erre'i Taiojet Pites: in lA) at least 
;/preferabrv alx .1 .a ; , b- and i^-, ■ \ ar- to: and TS riay include 

twa (A^ preferably a-F..rat:ed ny at no:^t " tas^a-o P re rably , caie segment 
is 5'-\l6G-3' and tn^- -^tner is ' -rw^xatP/bUac- ■ ' , u:.th no ^nt^" ryonmg oases. 

may occur m a a^iim'T reqion or v;ithin, t r outsi.c-, a raiMU.i -at ony 
re-ji: )n . 

Pr'-ferred Process: Many difterent potentoa^ PSo .ure s-jlecte-a rr^.m a target 
gene. The ZFP of (A? has i tmgers, 1 spe^ir^c :.:.r e.ch ol hdc, a:Jy and 
r]tb-:. Preferably t^acr; fin;^er ts selest'-a an i ies.aned ..rtiepen lent ly trom a 
database of EFP thit -t ntams desagnat ions - f PFr, soP-Jesi.^n at t'-^ns or 
individual fingers and nucleic acul seguent:^s b^'und by the iinuers. 
Alternatively, a fLng-rt ls sele^:tea ny 3creenir:U var:.-.;nt3 oi a ..inc f^ngei 
binding protein for specific binding to To, thus luvntifymc a 
variant that binds to TS. In tne netnod : : 'u;, ..t :-.st 1 i-t ent i a TS, 
with tne highest s:tre, is the O'Jtpit and tr.e sunsc:res are -tmomed as 
rh.-ir products. The c-r respt nden .:e r-^-;! irre t'a:tpr:ses tripiefos and 192 
subs-ores. Sutscores ciie leterminea h y assi^nln;; d - : eren- :■ a : r es far a 
•^i:r)^-'t f trit-le:s jr:d C'j r r^cst canii ng tasitbans Mr on::.cn ^htr-;- is an 
existing ZFP with a fingei: specif i:: for the trioiet : rom th^ _ same _ position 
as in the cor resf .on Jence regime, a sur.set for \/nicn jii existing rin^^er 
specifically binds to the triplet from a different p asi t ion, ar.d a subset 
for whi'^h there is no known finger that specif icaliy binds. The 
sr-ecif ication includes a table showing the correspondence regime. The 
subs,-r.re of at least 1 triplet may be combined with a context paramerer 
^(CP) to give a scaled suoscore, preferably when the target site comprises 
th*- s-quence 5 ' -IIliGK- 3 ' , A ':'P may also be combined with the score for a 
potential target to give a scaled score, particularly where the potential 
contains motif (A), and different CP are used depending on how many (x, 
a), (y, b) and (z, c) are (G, K) are included. A similar prtcess to_(b) 
may be used to proouce a two- triplet target site ana a- finger IFf . once 
ZFP and nucleic acid seguences enccding them have r^een designed, the 
proteins can be produced by standard reccm.bmant DNA itethods. 
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T'" 3-D structure of st-rile alpha motif domain used as mooel far determining 
3-D structures of ao.'ii tional native or mutated CAM oamain with unknown 
structure and structures of co-crystals oi CAM domai:': with nodulators. 
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pRAi uj L ■r^^-ii:-i9r! r^3:^i.:iB 

:C I .:M C07K014-705 

I .:S A*.ilP:':)3&-17; AoIP0^5-00; C07K014-47; G06F017-50 

AB ) :M:.;-)3:' "jOO a UPAB: 2mO'Vj311 

N' i\^Ei;rY - A piirified t.M*;?-; ■iirnons i ■")ri3 1 structure of a polypeptide 
(1; (.'^uu-eofv-jri'-jing to 'jr.o ■:)r mor-? sr-^ril*? alph'i motif (SAM) ■Jomains . 

['ETA:L!^L> DESORIFTI'JM - I!il)EPEti::;£NT CLAIM;.? are also included for the 
f --1 lov; ma : 

'].) -A purified cr'y;f.niiiline for:n ^ ZV) if ■!); 

[.:) forn.mg OK f ( I) ; 
''\ a p:;. *.eri t i .1 L its^ialitar ;II ")f . function :)f S7iM 
d-Tiair, :d' --ri Epri : E ::y^ :': r::)::)0 l -'a n prjiu::_ng hu'nan hepatocellular carcinonias 
c-J -i- ine.! recept''ir : ^ient; i f usir. : Ob'; and 

M .; -.x peptide 'ill) whirh nedi r es OAM djmain 
function ::nipfi5e:: '/VOV, .3AV\'0\0 E."A'/V, FSAW.IV, FOAVVSVC^D, 

v';ov:.;i-WL, fi^tv, emt'/fe, F:JT7L>Ei;h, TOF:r:v:;E;vL, tsfntv, ytsfntv, rgev, 

ROEVLO, ROEVLi;Vlj, VPFR3EV or VF>T?SEVLGW -r naa a f.:)rmuLa I or II: 

^i; F-:-:i-x. -x3->r3 ->:t-:<o ; 

1 : : ) >: 7 ' :•: 8 - x - :-: i o - :•: 1 1 1 1 - :■: 1 3 - x i ^; - :•: i f - x i ^ . 

:■: = .:lefined in DEFIHITIOXO : F'uli ini^ ions f^ield 

AOTr/ITV - CytjStatic; iTTaiuno/'upp r^-a^- :. ve ; v alrier.:i ry ; n* 'uropi'ot ect i ve ; 
n-.-:.-t i--:-r:d c; anMp arki.nc :-ni an; cereor-::'pf Dt-rct i v-, ^n t i a ro. hrit i c:, 
arit 1 u: nei cisc 1 - r'^ 1 1 . 

!!E I'HA- II OM 'IF A'TIl-X - Ep:i . E ry a i-jp ..i ^-i 1 n producing hu:;ein 
hepatocellular car irior.as :ell lin-.-; receptor SJ\A :i-am:iin nodalators 
I I a ir:;e :i } . 

■'OE - The .structural :■:)■:■: .r.narea -jf a SAil :ioin~in :.)r CF of (I) can be 
us-ed ac a nic-ael f'jr de*: e r:riinii:-.i tn^.; vra^ee ^i:a"I^Jna lor.ril a ::i"uct: ares of 
polypeptides veiO: h ::>/al j .^ri-i Lua „cyca^:/n ru 't'..r-;i. Tr.-:^y i\r-: als:^ 

u---i r--r iden^ifyin'r (Ml; .1 p;*en::.;: rv:iu.at-::r a lAM d:)r.E.ln of a Eph 
f Eryt ni : C'-jie*:. in projdu-.:i n if nu.ir^n nep:^^. -^c-. 11 alar :^ ■:ir :n n jnias c-.ll line) 
re'i-epr.or function which i:"r/<j 1 vea -i^:.':: ei ^ a ■:: xapur ei 
rec'reaentatio^n ot a structure ■.vf a c-jn.p'jur.d witri the ^'.■;>:np)ut-i- 
repiresentation of a structure 'One cu- more oAM d'MT.ains of an Ep-h 
receptor. {11) identified by the ah-jve nethoil or a CF, can h^e used for 
treating .a dioedse dSSC'Ciated v;.i tn :.naj:>propr late activity of SAM domain ot 
an Ef'h receptor in cellular or-^'fini sn, such tn^it a CAM domain 
function 13 i'l'tivatea or mhit'itor t'O troBit h cell p^rC'l i f er ave 
disease such as cancer, angiogenesi s , ht ne ros ^leros i 3 , arthritis and 
diseases ass>jciat e':i with rne nerv'jus sys*:en ( jliimed). The knowledge of 
tne r.hree dimens icaial s^. ructure of i CA!'I d-.^niain, in particular ohe EphA4 
CAM domain enables us to identiry h ::mo J ^ues . The structure coordinates of 
a CAl^ dom.ain or CF 'r^f (I) can r^e used for identifying additional native or 
mutated SAM dt^mains with unh.nown structure, as v;ell as the structures of 
0'::-crystals SAM d'jmams with ccmp'Ounds such as mcMiul at ors . The 
sforuoture coordmares and mc-aels ot r-x CAM ■■iomain three dimensional can 
a^so be used to determine scd ut 2 on-base'U structures ot native or mutant 
SAM oomains. The siructural coordinates ot a SAM aomain structure may be 
appilied tC' nuclear magnetic resonance (I;MF; d.ata to determine the three 
aimens i'Jn.^^ 1 structures 'jf polypeptides. The raticTial cesign =ind 
i '"lerrt 1 f i c.:it i'jn ot mO'^iu Lat o-rs c^f SAJ: d-jmains -can t^e a i'cO'mrdisned by 
jtili.-.ing tne atO'mi'.: st r-i..-:tur a I CO'O rdinat es tl'ia: define SAM oiC'ii^ain's three 
dunensi'-'n.-j 1 struc*:ure. Th- new peptides (1:1.1 may t-e used t^. 
identify : eaa comp-junds f'-^r ilru^; deve I pii.ent . A comii^a r i s-en the 
s:ru;;^ure of peptides similar ir. s-guer-ce tu: liifforing in tne 
riolC'gica^ acti.viti-s triey elicit m target m«jlecales -can provide 
inf 0 rir.at io n ah'Out the st ruct ur e-a^:*-. i-'i t y r --^lo:'. io-nsh Lr' ( SAFC of the target. 
Inf 0 rrr at i':. n O'h'tained fro.'m the exair.i n = t i '-m O'f CA? can he used to design 
either mooifieii peptides ur rt'ner sm.all mo-le rules o-r Leao 
O''. mC'O unds wnicn can be "rested f <: r pr-ai'.:teo r^ro^oer t ies as related to the 
taro^et molecule. T.ne new peptides may be used to prep^are 
ar.t lb .:■ :lies . A:;t it'O';! ies and labe.ec .an^ irodies sfMacific fo^r Moe new 
peptides may Lie used to screen :or the proteins 

O'lnta^ning SAF-I domains. They may he used in the treatmLent ana diagnosis O'f 
disoraers assoiciated with ai: errant T cell signaling and tO' modulate 



tPlorrer.:- function. In parti-uiar, they may usfTui m iiieLr.ua.s 

f (- •■ triPtaC-v cellular senes-eru-; ar. i i:rLnorT:ali;:ar ion controlled by 

tr.'.-iir,.-^- J.'-njtn and t-slomer :is.;.- ac'::"i-:y and ;ele:t.ive imrnuno.suppressants 

'l;, ,-.r.3,.r, trarisniar/:ation) . Th-y r.'ay ..iJ^o b.. useful in th.;: t:reatnient 
ot '-an.^^r Tl-.. new tnr-ae dirn.-isir.n,. L :-'AM d-..n::in structure, r:ew 
peptides .r.l lu :.du . at . r r-.ay i-^ n-;.; 1 . ;■, nld the ^.lologica ;. 
^ r...., ,...-.^rvor Elm .ij,.ril .n .u cell and m particular 

lt-..'ral at ina 'a 'pathway in" a cell r.-r^La-ied by tne liqand or the recept:r, 
particular. V -nose c.thw.ya irivo:-/.:! :n n. arrna... d-ve bapr^ent , Hxcnai^ 
mi-;raMcn, p -i': h t mdi na .-ad re.jen.-r..:t.on . '! no new tnree drnv;n3.on.. o.-.A 
d ■",ain sti ac-are peptides "ind noraiatar.s identiried using tne 
m.:-th:is wi.l i;- uaelal la pn an-: .a .t : 1 ; -adii.-e -.;on ::-^;ane:5 ^ a , nerve 

irr,:.v:v-- :as ar.i r .;.aene rat i ■ r , ■ tr-;it c;:ndi-;aons such as 
n-'r:.de.iener,:ive dia.aaea ^nd :::naiti::n3 m-'oi-inj trauns and irnury to 

, ...j ,. .-■■..■torr , fee .^c-carLDle A j zi.r . :r.e r ' a ]i-;eaae, Par rcina-jna ' ::■ 
dV'e^ae/'Hantir.gten' .;:seaa..: :l-.:r,yel i a 3t m :r iiaeaaea 5acn as 
multiple -alerosi,:., amy-tropi: 1.'-t..1 s-er'-sia, iMcterii:. .nd 

Wrim^-:k-'o 'Jiseas- oivu nut; r i t; i-jnci 1 f>:.:.yi:e.j ropa t ny , f: roai ess j v-;^ ^ 
o,.r--iu-i*-3r pal?y, :^riy Draper's s yri.;.ir.-:.:f.e , n-ii t: .s t^rn. de-:Tenei a t ion an i 
oliv- p-r-r-: cerebellar .trophy, r eiipher 5ii nerve darr.age, trauma and 
-...-h--:n^ . f-su:rdna Iron: str:.:.ke. Tn- f h . : :n ioe r: i ■: :i 1 e. -ncdosi L i on rreiy ce used 
t,, 3t n a" '"r inhibit rveurcriai o--- . ■: [: m-nt , re^ jereei ri t_ on ,.nd axonal 

ir- :rari;n u5 s oiat e.l wird: n-ur :: d-va-n-.r at i ve :r-n t i -ns -nnd involvina 
ti'aan.a uioi invjry t.r^' nei-vous sys*. e;:;. 
Dwyi . 0/:- 
Fs cf: 

J'^!: "^r: oii_r.-i2- ^^:--'oo. :.o.i-Fn;; H4-H01; bm-joia::; 

B 1 - "f' ' ] A4 ; Pln-u in; Biai-Sid. 

TF:oh:!'''Ll'GV focus - ^-FGAMIC CHEMIJTTFY - l-r-;rcarati:n: :Tr.- OF o.: ( -l j :.5. 
lr>-m-d taxing a -oiurne of a ::aM -aornain with a roseivoar solution; and 
in.^ubatinq the mxture obtained m previcus step over the reservoir 
s.d.urion in a olcsed oont ainei and^-r oonditLons suitable for 
crystallization (olaimed). - u . x n 

Preferred Crystalline Forn: CF has dim-rnsions o: about a-b-//.14-F/ o.J 
angstroms, c-: 4 . af / -0 . O'j angstrom ani haS a protein ooordmate 
da'^a as given in the p^atent speo i £ i ca 1 1 on . 

F-^-f^-rred Method: (Ml) involves dookmg a oorrputer rep'oesent at ion o: a 
C'-.TP'-.und ^reir. a .-'T-outer database with computer representation of a 
s^ie.-t^d si^e on a three dimensional structure of a CAM donr.am of an Eph 
\F-/-hr--o.-.i-tin proiucinq human hepa t (ooel 1 ul ar caroinomas cell line) 
:;::^^,t.,^\:."rp^ determining a conformation ci the rcmplex with a ra^orable 
q^r,K.^tri.-^ fit and favoraole oomplomentary mt er act : tns and :;:hen_ 
id-n^iiymg compcunis tnat best fit the selected s.te as p)Otential 
rr.-.duCat -a-s .-.f SAM :iomain function. Alternately, the method 
iVvolves modifyrnq comouter represent at ic n ct a ccmp-ound compiexed with a 
...]H.-red sitr. on a three dimensional structure of a SAM domain of an Eph 
^-.-..-^^pror ■■■r <^F by oeleting or add.i no a chemi tal groups, determining a 
.-■n^.-,^ mat -an .-.f the -omplex with favtaa.ble g^'Ometii'-r fit an.'i tavorable 
.-.■mrd^o.-nfary'interattitns ana '.h^-n icaitirymg a ccmp.-und that best fits 
t-he^ -^d^-ted ^itp as ootentiai :to.du:i a t ors ol CAM tlor.am function 
:* The"m^^thod tan also^.e perform-J by seo.ctmg a c-mpater representation 
of a tonp:und fr-m a c-mputer ^at.base w-:n ttmp.utei : tpre s en t at i n d a 
c^^i-^'t^-d ^it- on a three dimensional strurture of a SAM domain of an Eph 
^■^Peptor r.r '.^F, ana th-n sear:h:ng tor m:de>-:ales m a :iatabase that are 
similar t the ctmitund using a computer searcnin: p roaram, tr replacing 
rtrt;cns -f the ctr^pound with s-til.r onemi:ai stru^^tures frtm a '..atdbase 
usirig a .otmpound cuilding comipuvn rroara:L. 

rreparatatn: 'ITTi is prepared ry s*.anoar:i stlLi phase peptide 
■■' ' ' n "J" n e s " ^""^ 

F>-el-rred 3D Strui'ture: The SD itru;,ture of SAM damain is of the SAM 
a-.main e! an Ec'n receptor preferably F.pha, compiexed with cne or mere 
..■■.mpounds and ccmprising ine :r ir.ore heavy rretal atoms. 



TECHNC-L("GY FC-CUo - BIOTECHNr^Lr GY - Preparation: (III) i:5 also prepare-l by 
standai'^: r ecc mt'lnant " eohni'^UL-s . 
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AN :M00-3h7S6C [ 33 1 WPI>: 

PNN njOOo-P',iCi 11: -1 I'^r. T'POO' ■ - 1 1'/'^ 8- 

TI IP'W onab polypeptide fr'/ra Y : r op^ojci.'Ccu.? pneumoniae, osetui, e.g. 

m v.^icoine-, for diaqrjjois -..f inie't^ons, and ror idenoifying 

ar.Oih a'.:-r..-.^ria 1 agents . 
DC B')4 Dlh TOl 

:K C;iAI3-'PE, A ?; HOLMES, L J; : rv--PA;:AM , L A; JAWOPGKI, P P; LE:;N0X, A L; MAY, 

E W; t-:AP;.;ULLA, M; PAY, J; WAMi3, M; WAr.PEN, R L 
PA {P[^i?') / [-IITHrPLINE PEEP'riAM Cijp ^ 
CYC 1 

PI Wm ^pC.:! --^eC':' AI PPOQi^PPo {.,iCA:-:3..^} * eh b3p AOINPj-P-IS 

P:W : AT BE CM C'{ P-E P EC EI EF- (-B C? IE LU M(3 NP; PT EE 
VJ: JP 

ALiT WO PpC-YAC:;PPa Al WC' 1 ^-US. O: - ■:P^ I'^^rilMl 
PFAI IJC l^^P- 19187 9 i;^9Slllo 
IC ICM AD] 1 10 37- 1:':; 

I P'3 AC ] :i 0 1 - 0 ; C 0 7 H C 1 - 0 2 ; iP C' 7 H 0 0 1 - : M ; C 0 7K0 0 1 - 0 0 ; 
C07K016-00; CloIPCI -pi ; ; o' : PPOI 1- G06F017-00 
AB W'.) ^pOPPcP^o a :.I?AB: PCPoOVlP 

NP'VELTY - I ^''.^ .E.iPed Z^n-^iB. polypeptide (A- that, io ar lea.?': 7Pv, 
I'lentical v;ith a •L': 0 residue ammo aci^i se-quenoe, tulips defined m the 
spe'::if 1- ^iti'in, -jvei th-^- entir^^ len'T*:h r f it, CMmpr ioo s , or is, the 450 
rT-oXLiu-.^ s^i'qa^n'j.e , l-i ^z.- f,^*^_^.'d^d L y a i *L-oomljiriant p'O Lynuoleot ide comLprising 
a I9jC t'ase ]:-'air seguenoe, fully def.Lned in one sp^eoi float i on . 

[lETAILEl'^ r,Rsopg PTIP'M - I CDEPEMPECT CLAIIC3 are also Lncl-jded for the 
f : L l.owvn y, 

(1; fH-;,^ ynuoleotido (Pi; tr.at 

\ u , ei'iO'ides the se^.^uer^ "e, a*. P'-as^ 70 5 ^denvloi.il the 4P0 ressLdue 
sequeri'Se ; 

(b) is at least 70 identical witn a se^-^uenoe enc:oding one 450 
residue sequen'Se; 

{o/ is at ieasL 70 linrntioal with the 195 3 base p'Sir sequence over 
the entire 301- 1651 nucleot.ide (nt; sediment -^f it; 

(d) enc.:.des the 4 50 residue s^^quence; 

(e) is the 301-1651 nt segment of the 1953 base pair sequence; 

(f) is isolated by screening a library, under stringent conditions, 
with the 1 953 base r^air sequence, or a fragment c-f Lt ; 

(g) encodes a mature polypeptide expressed by the L'naB gene 
0)f 3t repjt'i'C'jccus pineumoniae; or 

is a compl emen t of ( a ) - 1' g ) ; 
(3) antibC'dy (Ab; irrLmuriospec i f ic for (I;; 

{3; diagnosis or prognosis ■: f disease, t^r suscef-tibil i t y , relabel to 
expression or activity of (I), co^mp^r is ing determdning the piresence or 
abserice of a mutation in the nucleo'tide segu^n'..:e en'.KHii.r.g the 
polypeptide in the genome 'tf the in"Jividual, and 
analyzing f'jr the p)resen::e or amC'Unt of the polypeptide 
expression in a sample derived from the iniividual; 

(4; s:reening methi-ds f-jr i-ientifying c-:'mptunds (PO triac activate or 
inhibit function of. (I), c-tmp r is i n -j : 

(a) measuring th- rinding c-f a candidate ^ixupoani to the 
polypeptide , o r t j ce 1. 1 s o r memX' r axes Lie a r i n :j t he 
polypeptide or a fusitTi protein of Lt, using a Label 
directly or indirectly associate! v;ith the can-:iidate compound; 

(b; me.asuring the r.dnding of a candidate comp-iuni to the 
polypeptide or to tne ct-lls -ir m--mbranes oearing the 
polypeptide ■:)r a fusi'-in protein jf it, in *Aio piesence 
of a labelei competit'jr ; 

(c; testing if tne saniiiat- cDmpouno results in a signal generated 
by activatijn or inhioitiion t-f t.ne polypeptide; 

[i] mixing a candidate compiund with a solution containing (I), 
measuring the activity of the polypeptide, and c::)mparinq it to a 



stanciard; 

f^) 'ietectinq the effe^:t ot a ::aniJidate ■'ompo'ir.M on tho r (;d'.:'::t. 
rriRNA *^;:.cO':l Lrig t^lL4 polypeptide, arid *:he polypeptide ii: 
ceils, asinq e.q. enzyme Lirike:: iiTiiiianaaorbarit a^^^say (EL.I^'A); or 
: d c ■ J n t a c t i ri q a c jmp C' s 1 1 i c- n c ■ j n\p r i s i n ' j t n e po lypep t i de 



with t 
i nt e ra'j" 
.;i dereo^: 



^■anLpourid to b'l 
L jn ijo iriq ass^ o 
abie siqnal in 



' screen'i-a t -j a.ises:: the i. : itercicr ion , the 

;iat':;-'a v;i^h i b-'OO'n':: ':'oi\[pr):iOiit capable of" p)rovi;iina 



iijn ol zh'':. 



occurs ; 

:::ivity of (i; ; 
no.t, cell; 



d") residue 
:^a;red o:\ tdie 



FS 
FA 
MC 



TECH 



polypeptide and cornf ■ :.nn':w an-i ii- 1- rifd r. Ln-a it inreracti> 

; ./ -:g'jnl5tc ctn^i an: aacn i a t s :d the '-x|:i rec i i 'jn or 

f 'd exr-.'r ecc i.c n .Fyatein C'jiiL]:'r i a i :":C {II), C'rea^rr.t in 

f ■/ . h-aat ceil, i a iT.^-:T.tM an-=- , ^ ha^ ^:LC.tainc t::e 

..xprec. -a : I) ; 

proa.i-ticn :■ : r.y cji:.rin';j a^zll. ci (7); 

■:■ p;r ^'ca icn M -^^Eib: :d (/), ^-r ita rrierrb:.:ana-a , hy t rana f ^ .rrrat ion 
or' a r.";na :e :t ia^n; 

^ ! 'd ■cjrr'jjat r - r -r :;id i-E- i:i'r :m air. C'„':'iC a i n ma ai: b^.^tot t: 
:5eqaen'.-^? and-'^-r the b/5: baa-;; f^air e':|uen':'e ; 

■ill a'jrrp-ut er-t uced iaeah::i f n^Eni'id ^ay i den*: i f iad L i 
1 9: ? :j,jae f^aii' aequence; 

fid) :^'jmputer-k aced metha :i o^ rc i ynac iec't i a*:- a^oeitibiy ^'aaed or. 
loant i r icat ii::.n a^f an ^-verla]:^ h^etwe-^n me n'-.;. l:aae p=i.ir a^oqaence, and a 

':;■} :::-:dynacde.: *: .d-:-a of f:r:i.ala ' I I 3= ! :■: - ( ?.i ^ n -R. - ( ?c i m- Y (Hal . 
:\ ,ind Y ■-- ra/crcai-n, n.e^ , iTC''dfi--d nc':' i ec 1 1 do c r t:.aatner teem 
■-C'va i^^^t o Z'Tji: 

-racTi Pd ana Pd - op^Ao-ioAiw rTCicifi-c nucleotide; 
n. ,.ind n = o-:\o'j; 

■■■.a = : t't i C'na 1 1 y i f i ei 1 r*: : ;:3a-:' rail' seqaena*?. 

A''^ T I 1 T i' - .dr. : i. i: ' ..i ^ e j: i a i . 

ilFi'HAiJIdH 'JF AC I b jM - i nn i 1: i *: i : n :t bnaB, r^'joaiily a r-t'lijative 
:\-rl i CO; a^:- , v;ni ::n ia '5 ei\: i a I fd i jr.:->.aa-. :.:\d/':r iarrr/ai : . p^neun,on iae . 

St repf: C'Coccua rin^z-umcTi i ae , ia uaea t a':'reen fcir ac-radfic aq'::niats and 
.antaq'-c.i sta, po'-zential t herafi'eut i c iqenta, to raise apecifi':: antit'C-dies 
(Ahd / II. var':-ines, and in rational dr.^'j aeaiqn. Ah- are useful as 
diaqn-- s^ i :: i rwnun^ja a a 3 y r--^a a^-pt on':i 3s t ne rap'OUt i :• antagcnii t a . NU'jleic 
acids (AD tnat enc^:'d- (I), <~.i frciqmenta, aie used t<jr recC'iTX-inant 
production oi (i), and as prc't-es and i:iri:T;ers to istiiate hr.m'::d ogous 
iall-:,^nqth :::r qen^-nd : cbanes, iia ;jn'ta i;r , proanosis, ataqinq and 

tyC'inj feet ions, inducing detesticr. of qenC'H.ic mutati'tns, and fc-r 
chi -jnf^S'-.'me i dent i f i cat i'l-n or Tnappdnr. fll) ::in aiso tie used in qeneti:: 
ininaani cat ion, and as antisense innitdt-jrs. The therapeutic c:?ent3 nave 
nacteriL'Stat ic/b.icter icioai activity ind are used t treat or pirevent 
infect i'j'nS; especially th-ase reused r^y C. pneumoniae, tut aiso 
Kel i tt'.j :ter pjylttri infections and issjciat-i-a diaiucers, alsj fc-r 
treati'ient at in-dweilin'^ devirea and v.^junds tt- prev'=-nt t^acteriai adnesion 
L'wq . C' / 0 
■:pi EPI 

AB; Ci''td 

fPl : bh.;-C01C; B04-EaaF; BCt4-Ea:; Bt^-FOo; Bl4-E'Jd; Bj4-E'iOB4; B04-C01; 

B 'J - M 0 3 AO E ; 3 1 1 - 1: 0 7 A ; B 1 1 - C Cw A 4 ; B 1 1 - C a c E t ; B 1 2 - KO 4 A ; 
B12-K04E; B12-K04F; &14-Aal; B14-AaiB2; B14-S11B; 

:)'"t:--::i2; D05-H09; bat-Hll; Bm-rddA; ja: -ril4; JOa-H17A6 
EPI: T-;l-T 

dPT>d B0a:jC'7 1 a 



i t r-^'a^.ni'Eu.t : Wnen in::reiaei 
an o qord. s t at f I } or a 



IE :dlM-j:.'aCY Ft)CdC - BI-)L0C;Y - Pi-^:ferre: 
express i an/a'::t ivity oi il) ia require; 
fl)-entjiinq nucleis ^ i :i i.s :: :imini s te re d, o.voi wiien inr.ibititn is 
reqaired, an antaq-:»niat jf {I}, nuciei^ a::i:l that, inhitdts (i; expression 
or a polypeptide ti.at stmr)etes with -I) for liaani, aubstrat-': or 
recep)tor is a laiinis te re :i . 

Preferred diaqnasis: This involves ietectin:; a mutation in tne 
genomic sequence enctdinq (1} or analysis for presence ar anojunt 
:'f tl • . 



TECHN'^'L D'^Y F'^rUS - EIOTECHM :LDGY - P repar a^" i ci\ : ''I: i col^ t ^-"^cl f^'o^L 

St rept o :: j::cu J pneum:)niae .U ^099:^ (NCIMP A')'301) by standard ruethods of 
^::ion:.::-;j .md a-reenir.g fm rrv ira details.;. -Jnc^B isolated, (I) oan be 
e:-:pre;--sed In any conv^erit :. :)n..il vei-t Dr /h :. -st system, optionally as a fusion 
protein . 
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AN 2-'iO:V,:.3 / jt ^ ! b;-; j vJPi:-: 

DNN N^O'MJ- J'.^a:o3 DNO 00 ) 0 a - 1 1 ^'S 8 

'11 N-w pseai.loaro.d^ne .r-yn^h^is- I'roni St r^eptocoocus pneumoniae, useful, e.g. in 
viooines, for viia :jro:'S i s mfeooo jns, and for identifying antibacterial 
a "J e n s . 

00 R -1 [)■;.) TOl 

FA (.dlldO OM:TH?:LTr]E RFEOHAII 0^ 'RP 

CYC 1 

PI W''.) ..(■( o 0. 1 ^ Al P''-ea^,.- ; , ■ ) v) a '-O * EN 0)p A01N037-1;3 

RW: AT BE 34 CY TvE OK 00 FI FR '.IB Or IE IT LU MO NL PT SE 
V/: ..TP 

ADT Wl) ..00:m-)-";:^;1m Al VJl) 1 9 99- 00 J hi ?0; 1^991:09 

pRAi UP 19^^^--: ?as:4 ^;-n-o II 

10 I'"M 70j lUO ?.7 - r-: 

i:'S AolNOvl- ):;.; '00:d{.n..I-.OO 017JiO:i-ji; C07K001-00; 
C07K016-00; 01109:: t-tt; i: IP PIP 1 -Ob; G06F017-00 
AB W'.' P0C'P^y:rl9 A OP/O' : OtTiOiT:^ 

Ni 'VEIiTY - [sv'Iate'i polypepta.de Oi ) frcai S*:rept-acoc ::us [pneumoniae 
IS a p seu d':-ur Idine syn*:nase. 

PETAILEP rPSOPIPTI'lN - IsoIc.t-:.-i polypeptide (I) is: 

Oa} a*. .e::.st 79, i'-iP, ---.n- ^^50 i.dent^cal -I'ver the entire length, to 
d-taneu amm-j .loid ^aa; s-quer.ce (11) ^f P9P aa, aiven in the 
srjeoi f loa: : c^n; 

' b ; a n i s 1 a t e-:! polypeptide c jmp r i s i n :j 'II) ; 

' , ^ / r . _ 
1 - / \ J. ^ / , J J. 

(d) is encoded by a re-r-jmrvinant pic lynu ale :/t ide comprising a 1061 bp 
sequence (111) given in the sp'ec: f i cat ion . 

INPEPENPENT CLAIMS are alsc- included for the follcwing: 

t T \ - ^ , ^ ^ ] ^ A- -A ^ . 1 , ,^ . , 1 .. _ 4 - /TIM . .V i .-.v. . 

\ J- I zii ^ L L a L -T'JL K. ■.' ± J i L'u-.- ^'1'-.' ^ A. '^-r \ 1 V y ^Vi i i j. l . 

(a) is an isolated j:>ol ynucl eot ide encoding a polypeptide at 
l^='ast 70, ^0, 90 or 90^ identica. .aver the entire length to (11); 

(ri) C'jiiiTL'r i ses at least 7(:!, :-:0, 90 or 9^o, identity over the entire 
Trngth to a ii'Olynucleot ide encoaing (H); 

O-O' S'^mi:. rises a nucle-itide seauence v;itn at least 70, 8 Lb 90 or 95 ^ 
iaentity c-ver the entire iengtn of nucleotioes :^7-37P of :ill); 

(d; comip>rises a nucleotide seq'jervZ'^ encoding (II); 

(e? is nucle'i^t ides 97-';-^7j'. of (III;; 

[I] is 'jbtained t^y screening a library, under stringent ccinditions, 
w:th (III) c.r a fragment of (III); 

(gi ej";C'jce:f a matui- polypeptide expies.:ed by the Yfil 
([■ seud^jur i'iirie syntnase) 'lene sf St repts C'jccuc rcieuinC'riiae ; o:r 
(h) is cc^nrlemerit -jf (^}-(g); 
t .'0 aiit it.t'idy (AtO' inLmuri'i'Sf'ecd f 1-: nor (I); 

{ S ; IT;- th 'J ids of tieatin<j '.M-n'di t i':: ns r^quii'lng increased (jr j-educed 
activity 01 •■rxC'iessitTi oi (I); 

( ■] ) dia..jriosi3 c.r pr^i-unr-s is of disease {or sus'::e|:'t ib i 11 1 y ) related to 
express I c.n cu- activity cd" {D ; 

s r^-^eninc: m-^ttiC":!.- o.,r- i .::ent i f y ina rajmpc'uncjs (A) thst activate t-r 
inhibit function -if [I); 

aqccK.sts and antagr-nists of the expression cr at:tivity of (I); 

fOO. exp'sessitn system '-<tmprisLng (IV., p'resent in a host cell; 

(■■:; h'.-s". cell, t-r i:s mei(iti::ne, that ctntains the system cO (7) an^l 
expresses ; I : ; 

['?'; prt'ducticn of (]) by culturing cells of [S] ; 

flO) rjiijducoioin (^'i' ■■ells cO (3), C'r its membranes, by t rans f ormiat it-n 
or transfection with ( lYp ; 

(II) a host cell pr-tduce-d Py the method of (10); 

(1^) cc^mput er-readable medium ccntainino at least sequences (II) 



and/ or (III) ; 

(13) C'Jmp*uter-l:^a 3e':.i meth'jd if homolo'-jy i '^''^nt i f i ■■-cit i on , hioed 
s e ^ u 0 n o e (III); 

fl4) i::':T[ip>uter-J: ased meth'jd pcd. yn jcI.ocj ci de ciss-inbly b..i-:ed or. 
1 den": i f ioat i C'H O' f: an "^v-riap i:-etween (III) and a secorid nuclei':: acid 
oequon:o; ami 

(II) P':d yna ::lec.t ides ol to^iuiula (Ila). 

X- (RI iui-P.2- (R:^)m-Y (Ila) 

;< ^riLi 'i' ^ hV'iio::-;"!, iriot^^jl, modi flea r.uoleotidc: or t^jqetlLer form d 
■:'jvalanr j:Hjnd; 

oa :n Rl anii P.: - ^-pt i '^na 1 1 y iriodifiod ro_. j I ot olIo ; 

m an':i n = 0- GO ; 

R.o ^ r oo ic'n :'7 - ? :■ cd ( 1 j . 

A'::TIVrrY - Ar-^ ih^.o-^ilal. 

N'j Dicd«jgic:al aara io <:;io;en. 

^: S'^ nt 1 i 1 f'M- q.r''jvrh jnd^cr sai'vival ot pneojiaaii-^e . 

- yoqaeno- (II), obe p'r-jriaot c- f one Yfil (piseuac ar idine^ synthase) 
aene of S : rep't ooc 0':'as r n':-ai:b: n.i ae , is -^sed. to saieeri for st'eoiiio aO'Onists 
-und ant i-g^: ni sts ( p-j": ertt i 3i t h^Ma|jeat i aaents), to- raise spool tie 
an"^ i h ^-dies (Ah), Ir. v::0-::ines and In rational dr^^^o des.l-gn. Aij are useful as 
^liaqn-i'Stio i rranun'i-a.i-say r-eagenrs ^nd as onerapeut i ? ant aac n 1 .or s . Nucleic 
.o:los (III I ^na" or.oC'ie (I), <: r f raam-jno.: , ^re as^d f^L^r e'^LniL^inariO 

■ :■<:.; a-: '?:d.^ n ■: f (Ij, ^s p r-di^e.:- an^a primers tc isoL^te horn : I C'QL^as fall-lenqth 
':r a^:-n':'mi^: i-l'i'nes, f':i diaanc.si^, p^ rcoinosis , staalna ana typing inf-otions 
( 1 ncl a'Ul n^:: aetoctl';n :f gt:-ncnd : irr^tat Ic^ns j , antJ iljr ciir-ziLOsr^me 
I dent 1 f ioat iijn or n.ap'pdna, in 'genetic iroiToin i zat 1 c-n; and as antisenso 
Innihdtors. The t h*;- c ap-vjt i ac-n^s na';e r act eric st a: ic /r: act r i cidal 
aotlvity -ajI are cre':; trea^ ■: r prc/cnt mtectiC'ns, e.5i:eoially thc'se 
caa:eed o. p nc:um^.'nl jO , ii'-^r ^Li.-j -e i i LODr^.i-.-t e r podor'a m i: e^:.:"^ LC>ns an-i 
■ s s<jo : .i : O'l '"i i sO' rc^r r s , a 1 SC' tcr *: r e 5 tii.-nt ij.t in- 'ivar 1 1 1 n cevioo.z -inu vjo\i:ids 
'o: p'r event a ::t o : 1 .;il udh-rob.'n. 
two . / 

FS Ct'l ERI 

FA AB; D'lN 

MC oRI : R04-Rfj}E; B-j4-Fu-; R J 4 - FL 1 lOE ; R04-';;C::; 304-L05; Bll-C:o3; 
B12-K04A4; B12-K04F; RM-A^IA; BlI-AOlBl; 

D05-H09; L'05-HI 1 ; [jo -H 1 sA; JiJo-HllE; idJt-Hl-] ; D05-H17A:i 
FRI: TOl-d 
TECH UETR: ^00007 

TECHK'IL'JOY FOCdo - BRjLfJo-Y - E referred Treatment: Wnen increased 
expressic^n./act ivity -li (I) is required, an agonist of (I) or' a 
( I ) -encC'U i ng nucleic a :d d is administ ere'?., and when inhikdtnDn is 
required, an ant ag-jni st (I), nucieic S;<:id that inhibits (I) exp^ression 

ox a polypeptide tnat cornpetes v; i t n (I) tcr liqand, substrate cr 
re:ep'tor is administered. 

Rreierred Diagnosis: This Lr.voLves detecting a ru\itation in the 
genomic seq*.:ence enrc'ding (I) jr anal^'sis ii^r p)r'tsence or aiuoani 

J f f r ) . 

Rr-ef errea S careening Metniid: Metr^.jd cc-nip'r i es : 

(i) meiisurinq binding of ^est :^':'mp-jand (TC) tj (I), s^r cells jr 
rTiembraries carryiricf (I), -i^r its tusi-jn protein, f rcnn a I^bel 

t i n ) ':i i r-e rt ly ass jc 1.5; t - :i with To; 

(ii) as (i) iiiut in presen::e oi lir-ele:! c-i'mpet it 'j r ; 

(iii) -iet ermining if TC pro)':lu::es a si-iin'^l generate'i by a:uivati:)n or 
inhird titn of" ( I ) ; 

fiv) incuc-ating Tt: v;irn tl), "nen measurinu activity of fl); 

(v) determining : he effe:^t Tl^ jn proiu::td.on (I) or mRttA that encodes 
( I) ; 

(vi) in::uh>ati.ng ^I) arid T7 :;nd ^let e rniin in ^' any interaction, from a second 
t'l-mpsnent tha^ prod,;t-3 a de te ::: at; le signal in r-jsponse to this 

i n t e r a t i :) n . 

TEtHI/CROGY FCtdS - B lOTF IHNOLC Y - E^repara t i on : ^III) is isolated from 
3t rept DC-occus pneumoniae 01-70991 (NUMB 4 7800) oy standard m.ethods of 
::ioninq and s::reening (no m.ore details). 'Once isjlated, (III) can be 



protein . 
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DNN i].:'jOO-:i^4'V/4 [--[ic c.::0(ii.i-i''"ro:; 

T: proteins .■mf.'r : i n^T je ;i r ;::h : ri 7 j 30 conirirer database 

I'-jr t rairirrv;':.rk proteins w:.i-r. -:.m tri-^r. be m^j-if^ed, u. etui \cr 
■ A.r: deoi'in and eng i iie- i L- iq 'jt cj-. ii' iii e t j ■-^•s . 

DC B'' ■] ri] n TOl 

IN ANDP.EW3, P R; DOOLEY, M .'; YMVYiiE, [■] 
PA -'-"rA'-/) D!'!'/ C'YEEN^" PA' P' 

CYC 

P^ A3 J'.'^ ■/ -1 Al P.A-ji-^P'.' .^CP^A..^)^ EN 7 :A: 'P(' ; K" I -1 - 6 I 

PVA AT BE ''"H CY [p EA. E.:' PI EP" ''-P AH 'jR CP. TP. !T P'ii. P;-- LU [jC MW NL 

"A l-'T 3D SE PL PL TP YP; PW 
W: AE AP AM AT AH AP BA BB PC BR PY CA :H (3N Cr CY CZ [E DPC LM EE EG 
EI GB i3D GE CPH GM flF HY ID IP IT; IS JF RE KG KF R'F. RZ LC PK LR LS 
LT PP^ LV KA K'J MG I^K R!J [^]W MX ]\0 NZ PR RT P(3 PR ?D PE SG PI SK SL 
TJ TM TP TT TP i^A \}G \j3 nz VP' TP ZA ZW 
AY .p3iAY:iI 1.^11 A 2::'00r;:.:.iy (P00(P:7) CCiPRIia-Pl - -- 

APT VK: P0P':")p3.R?4 AP VVj ^^-AV^ '"'1 ^; l-PPIOP]/ AY PPPOOll-Al A AY 2000-114 1 1 
1 .-^ ^IL 3.1 

ECiT AY 3:Oi;'.i';11411 A Ba^eJ en VJc pip.K.PP'RM 
PRAI A'.- 1 POp-CGOP 1 'PP: P." ; 

IC la.] C07K014-61 

I':S C07K014-535; C07K014-54; G06F017-30 

A.E v'P.- .'YOCa j47 4 a uPAi' ; PYPC'P*.. 17 

N^'\T.LTY - A riethocl I'M" protein eng:^r.eor inq (j(:::nc ris :.no searcTiinq 

3L c.-'i:,f uter datal-a:^^;' :or rrarev;ork proteins witr. cAsi.red 
.r^r 2"ijct aT J:ea Purer , :: e ri^Mv. 

DETAILED DEPiZRI PT I <d] - INL'ERENC'ENT CRAi:jJ are mcludea for the 
t ; A ] ow 1 no : 

(1) a niethO'T c-f protein encu ne-:rr : ng coripr i sinq : 

■a) creating a cC'inputer 'liatabase with a nurabe}~ of entries, each entry 
::-::.rresp»onding to a aescr opt ion c-f a location and or i en: ati-on in } D space 
■i'^ ride ohains oi c^r:iir:C- acid leiiadues of a f r-LirnewcMd: proteins- 
lb) creating a query cc^r resf'Onding tc- a .iescr ipt i jn of a location and 
'I'rientat ic.n in 3D space of the respective side chain? of at least 2 amino 
acid residues of a sample protein; and 

(c) searching tne -.iatah.ase with the query to identify hits, where at 
Ivast 1 C'f the hits corresponds to a respect i*.'e framework protein 
wRi ch has structural simiParity t'D the sample protein; 

(2) a mC'dified framework protein produced by tne method of 
(1) whicn is a cytokine mimetic; 

(3) an engineered protein cc^mprising at most 70 amino acid 
r-'^sidues and 2-11 -.iisulfide bonds c-f a frame w.^rk protein, 
t'Ogetner with at least 2 aminC' acid le si dues an'jtner protein 
wnlcti die n'jn-cont gu'jus in priniary sequence; 

(4) an engineered protein naving an amin^o acic sequence 
selected from GGYOl, SCYOl, S'ZYOO, EREOl, EFT'd.A ERPO-: and VIBOl; 

(P) a computer pr^qriim for searcrAng a protein o-^irah^ase 
wnlch cc-mprises a nuriibei" of entries, each entry corresponding to a 
iiistance matrix rep-resent at i Dn of at least 3. -3 alpPa -C beta vectors, 
C-: mt: rising : 

(.n) comparing a ^u-rry witn eacn zlstabase entry; ind 

(r:) identifying hits by clique detectior., where a hit is defined 

ai'ccrdmg to a minimuir number of C alrha -C b-'.-ta vect'->r matches between 

trie query and each er^try; and 

fp) a computer prc-_-:ram which filters the hits i::entified at step (b) 

off:). 

RYE - The metho.i is useful for identifyiriq proteins 
suitatde for protein engir.ee ring, j: art 1 ::ularl y fc-r tPe design 
and en^gineering of ::yt:^':ine mime tics. 

ADVANTAGE - The method is advantageous sin::e a rrjdified framework 
protein may display one tr more desired character i s t i ::s , su^h as a 



function jiKiiiai t..-- '-.r ir.hii"-! t jrv^ oi t.h^: ^^lupl^s protein. 
Ff^ 'I'l E?I 

M''.: ''.'PI: B'")4-E08; B' 1-3 -liO JT'; BCi-3 - HO.;, G ; B':i4-HD4A; BO-l-JO^w* BO-l-Nl."!; Bll-COB; 
B12-K04E; D05-H09 

BBI: ?0l-J::i5B 

TECH 'ipt:-:: :l-~)0 ")Oi:>\i 

i BCHll'H./ )3Y F^^'jCU.-' - B> j ^'TE'"H! j 'Lvm -ri' - Preferi'-d i4*ith-j'i: Trie 1: r'. anevyor k 
protein ici'-rit L f i.-e 1 u.- u hi.^: h;^;; ^ gr-^ater i:t.a;jiiity th^ri t:-; 
;--.a:npi'r protein. Th- n-e:h'j:l lui-.n-r rv'-impr Ls^c lu-.-^cii f ying an aMiri'j 
: ' i_ i < ; ra>:!r.c:':- I" 1' r- r protexn '-/rii i r l-j:. :io L: j -.ii 

::.t, r:>7 ;:;\i}:*stLr i^^. '. ''^■■\S-t : -im::'.'"! a':id : du^^- v;itri at l.e^ct 1 amino 

.■\-.:L:i re^^:idi..e t tn^v .-'arnn.e protein t '"i oi'ira^e nujdifiei 
f smev/'Or k protein. At l- a^*. tv/'j of ^ r.e aii^in'!' a i resiiiaes jf 
tr.e sarrii:-.^;; protein wlii'i-:: sni.'S t i : ut--^ .-jninij acid re.roiues of ^ h-e 
: r aruevv'jr >: protein aro r.-" n-'--: i a^jii- in prdir.ary o-;-quon::e. Tke 
r(,-:.""Ji f i e'.'i franiOw<jr[: protein has irioreased i-tiU'^tural s liiiI Ian ty 
t-?. the .-"^-iniyrde protein. The rri-"-:! i r i o-i : rr:iniev;'j r k protein 
IS cap^able 'i-f e:-:h ibi t in^-j a function v/hioh is e .t her sirni Lar to, 
^.■r inhik'it'L'ry of, a function ol tn- .oanipdo protein, 

Tho Ao.Oiti.'n an^.i ■j:: i ent i^::-:. :d ea .;h andnc^ acia si vie- :hain of ohe 
r Lanew'Mo: protein .."^nd SuiTird.:; protein i s s i inrd i r as 

a f;:-ai}jha-o-h'-t .u vh-:-:.:::-. Thi- '> .i ■ ii'lia-C-bet: a ve'.O'js i.s ui rov: f^arm of a 

oiannoo'- luiLr.ix s-] - : -:.r *:rn": ■ 'j:. . A: .O' ep (oj :n- hi*:; uro liok-Li aocording 

ta struotural siinifarity wion the sampio protein. ^*o.ar:hing at 

.~tef:> includes i -rienr i f i oat hjn id' ^:he hits Yy; ':d oa^.e det-^-ition and 

rilTeiin^ii *-he hits i '.i>:-rd. i f ^ -o af. ;:.t.ep {■^). Th^ f oJ'i'i^wj i k protein 

:S a sni:jsi oystoirie-id :h protein oajoahde int err...; 1. tiisoffdde 

i f na^ i<an . The '-ys^ v'- ir:e-:o '^n protein L't-rLprises a* 

ri:,st /■■■ .::TT{:n!':' .-^-ds, r:a''_ng o^s\:lfdae hooios . Tt;-:- sairipl" 

pr o te i n is a 't ^/ 1 o k. m e . 

Prererro'.i Protein: The -^ngmo^i'-i protein exn it-its a 

function either sirrdJar t't, or irinik-itijiy of a oyt 'tkdne. The 

cytt'kAne is selected f rcrri grc'vr.h horn-ane (GH) , i nt eid. euo. in (IIi)-4, IL-6 

and igranu iooyte-'t'i.ilt'riy st imui at : ng fact'tr (i^-CoF) . 

Preferred '■is-mpiut e r Piograia: The O'jmp'Uter Ti'rograni <af (a) is VECTFJX 
prograni, preferabiy P'sSTVEC program. 
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-4c:E;:.6;: [41]; 10^0-491075 [41]; f Oih)- 0 1 4 007 [4 1] 
ij- 1^0714 L'UC C2ij0iJ-L':t^i.)t.; 

TI Hovel n.ethods for recombining os don-ait ered libraries oi n'loleic acids 
usoi.i rr. fir'j'duce nev; proteins an-'i new ve'^t-irs with i educed rates 

0 t r e ver s i t-n t o \vr 1 d t yp^e . 
PC Bi)4 COO PIO jr;4 TOl 

Hi LIT, L; FATTEII, P A; STEMMEF'., W P C 
EA ([4AXV-K) MAXYGEII IhC 
CYC 8 7 

P: W''i lOOOd 1 0 '^0 Ai i. 00r)(;)4 ;■) o (li")Ous*:d* EN 91p' Cl^Ni'lO-liJ 

F;VJ: AT r^E CH ':T C'E [)?: EA EC rd EP GB GH liM i kk hi EF KE El in MC MW NL 
OA PT CCi CE EL CE i^E liC IW 
V/: AE AL AM AT Ad AE EA BB B.; BR BY O'A liH CN C': t'E EE E?: EE E3 EI GB 
GFi i.;E GH GM HR Hii : i' IE id IE CP KE EO; ?'P Ki. EE EC E?: EF. L3 LT EU 
EV Wj I4C \^':\ ME I4V; MX Rf'' EE PL PT FX' FiJ Ei' CE SO: EI Eli CE TJ I'M TR 
TT !1A FR^ EC EE V!I Yd PA EVJ 
AE 100001 IeEO a ^'iOE04i7 (InOOiP) Cl^NO 10-10 

AF^T Wi' 100'Ji;10'.^00 Al 1 '^■;^0-F^Ell 0 F rr^^OiMlr; AE J 'iiiO : 1 1 0 i^L' A AE .iOOO-11990 

1 E ? i^ 'J 9 1 ':■ 

FF'T A'' i^'iOiiid 1 H Y; A 3a^ ed tn WO' .o;o-'. 1 r "*0k 

FEAI EC 1999-14104 9 i:^99ii614; EC R/E^-lOliOl 1-Eo09..9; EC 19a9- 117729 
l.^:^90lEd; EE i 9- ] lo r 1 ;i l9-'^0^i'a 

ic i-:m cieroie-:o 

I :S A6iP:03 9-ll ; BOl J019-00; C07B0ol-00; CllN0o5-l'_ ; C1EN007- 04 ; 
C1EU015-11; G06F019-00 



ICA ClJNOl^-l^; C12Nul5-4r'; Ci2NHl^-io 

AB WO :iO';oi8 ^n»^ a u^^ab: .:o'icn-^2: 

:-]rr/l-:i;rY - inet:.n' ^.i r^f in.i r.' ] C('dc^ri altered nucleic acicic (NAc), coiuprlsing 
pT' i.iii nq u NA s-.-qu^-n^ve ;[]A1, v-'hiicli enc^^des a polypeptide { ) , 
pr'jvid.mq c^)^io^l .ilt^-r^-d ::A ceqi.er.c^^-3 , ea::h encoding PI < -r a m.jdified form 
ot .L.t , and recoirir.' in i n'.j the '^'-don altered NA sequences ■ produce a target 
■:-rMj(:»n , altered MA which enccic-s a 3ecC'nd protein, is new. 

:)F:TA1B?:l^ [■E.:'':^Pd PTIo:: - INU'EEl^ilZNT claims; are also included fc>r tne 
f"c 1 Ic'winq : 

a l:br.:iiy L-f c-d^ n .;1^ ere^i c . n.: orvat i vely, z^-^ non-^'oriserva t iv-ly , 
rrC'difi-d IIA^ prO'::.:x:^:d by fh^:- n< ^ve 1. ine^h':-d; 

; Oi rh^' ceLl, -'e''":'.'r- i:c: v.. rus 'y^C':: p r is inq 3 :,:irg-rt ■.■odon -.iLrered NA; 

b; ) an attenuate*.! •/i::u3, pr-jducO':! r-Eic-: mbr:: m ^ a target rcidon 
a.^eied dA 'a.th j p^'r^i-m the viral genome, wn.:h rucjduc*:o: an 
inr'iune r^'-cp': nsc- ar.-cci i n : e<":t i '"-n of tne vijuis iri a riamrria I ; 

■t) ,.1 -iial v*i'.-tc^:' |:>r ij'"iuc^-d by r'^-'::':i[ibiniu.g a target c^nic:; altered NA 
with a pHi.rticTi <:-t the viral genome, where of'ticTially the vect-or 
requirec trans ci'inpl emen*: at I'^n feu; r er i l cat i oi: an-l h^s a redu-:'ed rate cf 
reveisi'jn to a rep'.l i cat i ve form, iM-inpiai ed t'l. a wild type vect^.^r; 

(h) a method c f niaki.nq a lilaiary of coaon aJ.tered !.'Ao, ■:• Mnpr.is ing 

(u) seiC'Ct irr^ a IjAi whi :-n eni^ooe^ ?!; anci 

[t ' makma --.'r-cv'-n aot^r--d KA c-qu-n:-T:s, (c.:up.r^ l s i:iu ' ne liujr^iry, ea:-n 
-ro:odi.nu 'jr r;i-'dil i'-u f^-uin f it; 



iT.e on . -a j : : o ] , w;ir :n 



■j'jMif.) ^ at l^ra^^t t, .1'., oi; l-.-J ':;vU'Mi ai{:ered I^Ac ; 

(:-;} cC'iTipccit 1 :-n c-jriprismq :::d:n aiter^rd NAc '-.^/::h of wnl-idi encode 
r 1 or a ri'jd i 1 1 - a t- -rn- '~ r : ; 

■ a) .M liLirary f c y.-.i i ro'i NA,?, conirrdaing ::-jd-jn alc-rred NAs 
■jer-VO'.! frc'ifi h'::f;.0': ton? NAc ; 

(1^'' u r.it c:urorr5ina 'ihe 1 i;:u:^ry ani a :':o.tain-^r- ^nd 

ir.at r ^>-.'ns f':,r i e-ai-KiroUiii'.g tw:^ or mtre meiiil-ers I'f the lik^rary; and 

(11; aj: integrated system, coiTir-rismg a ::onipater or computer readaole 
medium comprising a database having at least ^ artificihi homologous codon 
altered N'A sequence strmas, an:i a user interface all'jwing an user to 
s e 1 e c t i V e 1 y v i e w s e g u e n o e s t r i ri ii s 1 ri t h e d a t a Li a s e . 

ACTIVITY - None given. 

MECHANISM OE A'tTION - Vaccine. 

N S E - 'i h e me t In :> d s f t h e i . n ve n t i ■:■ n are u s e d f d r r- e ■:: omb i ri 1 n g 
CO ion-altered libraries id" nurleic aciris to j:irodU':.e new proteins 
, which have imp>rovement s in a desiraole characteristic. Target nucleic 
aci'ds inijlude thC'Se codm; f'i'r tnersipeutic proteins sU'Ch as E?0, 
insulin, peptide h'jrmones, g:ov;tn fact jrs, cytokines, 

interferons, inte r leuhins , leuoemia inhibitory facror, and 'Cncostatin M, 
3s V7ell as t ranS'^r ip't ion and e:-:p*ress ion activators and proteins 
from infe'.:tious 'Organisms for use as var.cines. The metnods can also be 
used to |:'rc>duce attenuate 1 viruses which have reduceil rates ot. reversion 
to wild type. 

DECCRI PTItiM 'OF DPJVWI N':7 ) - Tne fiyrure is a schematic diOigram of the 

human eryt hriip'jiet ir; (EPtf w.jtible sequen.::e sj:iace. 
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C04-E-C; C;>4-E'.C; A:;4-E:^t; Cca-E'ivi; CO -l - FO 1 Ots ; :o4-Fllj;)E; 
C04-N04; Cl]-:Oh; '■..:il-:Oc:E; C12-K04; CI4-;;:il; 

C-v. -:::■!-; DMt-H'iT; D05-H09; D05-Hil!32; D05-H1^E; Dc5-H12F; 
PtP-ifll ; Jd4-B01 
EPI: T:jl-.Ji)SE4P; T01-S03 
TECH MPTX: dOdJ'ibj. I 

TECHN'jLOCY FCCUo - BIOTECI-iN'dLOCY - Preferred Meth':jd: At least one of the 
codon altered Uh sequen":es djes not hyoridize to htAi under stringent 
conditions. The metnod further comprises shuffling a [lA com.prising a 
subsequence of NAl, or a substantially identical variant, with one or more 
of the c-)d-.)n alterei UAs or with the target codon altered NA. The method 
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rurthei^ comp.::"is^:io acre^iiiiiy Lhe seC''.'iid protein (r.:) iur a 
.f-^rucMira: c functional r'T-'-pert v , ^nd c r^npa r ,l: .q ':h..i.:D property 
t. (.. tnat : El. P2 pre feral. I y rias a .-■ t :"uc*: al ^ a: functional 

pi'-.perty -qui. va. ent to « a- super. L-r rn PI, L'r Fa <:;nd to: are horuolc g-ojs . The 
''i:.;li..-n altered NAo C'rinpri;^- n ] 11 ^ ary r-f ■ -(-^OMn .i.ltered rJAs, o.r a iiitrary of 

d^n ala-red o: -rise ^t.it : ■ 1 y nnaif^-o r]Aa . Tln^- c-d'ai .altered ::As ia 
d^-r.ved : ron. ftrnis ■:. f NAl, i -n.pr i a- -a ,it lea.^z \ oc>d',n alter-ea NAs . The 
0( -Irai altered HA ;-equ^'noea ■^oripa .i so -oae noi>.' 'i-L: -i^jdor. u£ .::.ae oiveiqence 
r\- r each ^'t ■^nr- cru-i.^n al-^er-a \\/\:-' <'■! '■■'''^ , 7^ ^:, oi .^ J^ ^-r nor*:- at O'tinpiarefi 
-r i:aI, rujAinal ^t diii a.v^iae div-raea:>; f.:a eaoh L ah'- codor .-ilaier^td TJAs 
aa O'jmpared to i-Al , nen-'V/e t 1 a[:p ^ n<; at a-'-i ri. ei va: I'/e .oiL aa i t at Lt-aa in each 
',: tr.e '■-■O'tr. .i.ite-a:i. :iA.:- for.ia-r-i h/A, i.iah at— a>:rt?y 

aahi id: t.-;^ it-a he-^v;e-n tae t- hj: a - a- ti'.'a a aieroi aA:. MAI, laa 

H'tai f aitat 1 t^r t:a: tae ('aj-aa'a.- lo ]:ai'Viue -jne >tr r.-.-r- ^: . f r '"-r eat ra/dr t pt.titi.i c 
ci.a- r':a:a':iuea I'tr t:. -n^'^^tit'a polypeptide -it ^:ai';p r-'-a : ae 
f:rat polypeptide. T!i- p'-::ctT.t ^a-^atity h^--rv;eer 12 .atti ai la. 
;ow^'r ttiaa tne r.er :eat i.i-atily i.-t^aa-. 'a;t ol. the OD^.i^a altare:! N/.t . Sach 
'tf the ct-dtTi altered KA aeqaeri'^ea c-:mrrr2 aea a: Lea.- ': " w-:. di 1 f rences when 
c/rrpored t't IIAI . Tt^:- i*iet h'-.ti fai^h-:-r ■::':'i'ipr i a* /t : nt r^t^ia': i ng ahe target codon 
altered IIA into a cell, v-:ct : r or viraa. 

Preferred Virus: The virua t-f !t; c-;.mT:ir: aea viraJ -.-lenenta fr-in a 

lentivirua, a h-rf-'t:^- viraa, 3r:d a:i j':ha:C -,^a-: l it :.a:.a-.i v:rus. 

Preferre-.i system: Tne ayt^.ei'i ■:. I '11' :aj:'a.';a I'lpt ; t -^a <^ri aat _-nia"e':i 

-1 laL'nu-U-'"'" id^ ayrithesi a--r ^p-erardy J,ina-d - r, tr.e a atr>uta^r or ctmr^uter 

ii-::t':iable neai..rrL, \/h.cn ayaaaesiaer :a pr a::r,tr:jat d t" ::yt:t n>.^3ia:- - one or more 

'il ig^aiucl'-'jt ; aea compria :na .:'ne ^-r rita-r ^ ab/i.-quenc^-: a jf one 'jr mere oT the 

trtificial homt.^ogt^aa at---i:-n aJtere^i MAa . 

Pr-fei:r'-'d li.rjr-:ry: Tn- ::rir.-iry ^"'i ''^ rp'r i -r^-a at i-'i^at a, IT or 

r>r- ferahd y la a aa-i-"ai ^-iL^-re'l liAa. 

T^ChrOh'jdV FOC'JS - BA:ita-.:y _ Pr-^:,^rred lad yna 1 1 ec^t : i-: NA . ea-:ai^t 
^/l■ythrop^.ll--t ^n ali.ha ;Er-j; , granal .:yt--ct'l : ay st -ita^.^^t iri :^ f s.at ji' (a-CSF) , 
a vrral enve^op-e protein, a cy tt a^n-:" , .uiid a i ivjsprirjir .^s^t . 
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AU aOOO-rAj^]8':i5 [33; WPIX 

DUN ^a^J0(:-l■^0:^^:.9 Lino (fJUOa-l.) /:-:.;aa 

Tl dt^vel qlycoprot ease polypeptides and palynu .:l-':jtidea rsc-lated 

frorrL Staohy loco ^,cus aureus, used t-t S'::reen ftr ant i oav-; ter lal compounds and 
t'j i'iiagnt)se diseases. 

DC BOA D16 SO 3 TOl 

Ii: K03t4AT?CA, A L; PALMER, L [-1; TPJ.IIII, C M; WARREIJ, R 3 
PA C^^MIK) JMITHKLltJE BEFCHAM C'JRP 
CYC !•? 

PI VJt) ^000013^] 94 Al 3'".i0t:031a (jOMOja)^ Fl] "^Jp A»5 1 K'.i 3 3-7 4 

F:W: AT BE CH CY BE BK ES FI FR t^B CR IE IT 3U MC :]L PT SE 
V/: JP 

AjT Wt) 3000013694 Al W<t 1 999-dS 1 937 19-^30^:33 
P?AI U3 1998-14 9634 l'''93n9C3 
IC I'/M A6IF:0 33-7 4 

IC3 A61K0 t&-04 ; C07K016-00; C 13:i0( J i-O'' ; Cljr]015-0a; <: L 3r]0 1 5- -i 1 ; 
':i3P031-03; '.n 3000 1-68 ; 6v.) 1 IIO V-i-^:^ - ; G06F017-30; 
G06F159-00 
AB W:;' 3 tl 00 1369.; A UPAB: 30a'.i0 5!)3 

UOVFLTY - An ist'Iated qlycoprotease (q'::p; polypeptide (I) 

s~ Leered from a polypeptide hav.aig ...t Leiat '■')t, ^a-, 30:. or ^5^ 

iientity to a 34 1 residue ammo li^il sequence, over its entire length, a 

polypeptide comprisin-'j "he 34 1 resi.iue sequeri'.:e, ani a 

polypeptide en'tode-:i by recomoirrant p :)3 ynu ::aeot i de ::ompri3inq the 

C'ase parr (bp; se:pjence, is aea. AIL sequences fuLly cetined in the 
sped f ication . 

3ETAILE3 BES'33 1 PT I tJI ] - lULiEPElJDE^JT CBAIMS are aLsD intTuded for the 
f 1 lot/ in J : 

■ 1) an isc-lat-d pol ynucle':>t ide (II) selected from a p tlynuc leot ide 
hjvin::j 70^, 30^, or 95^ identity tD a pol ynu tie at i de encoding (I:, a 

p-- lynutleati ie having at least 73 :., 83^, 90': or 951 identity to a 1036 bp 
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polynucleotide wnich has the k^p sequence, polynucleotide obtained 

Yj'/ s::reening a library under cvrinqent. hybridization c:onditions v;ith a 
fjrokie navin<7 the 10,"1() np sequen;:-;-, Dr a irigriient of it, a polynuc:leot ide 
enccjdinq a n-atuire polypeptide exfjreosed by tne qcp qene of 

St;af:'ny Lococ:::as aut^eu^s, and a r)0 i yni.i ::le.;)t : ^ie 3equeri::e cicunplement a ry to the 
t fieni; 

(li ;in anticxD^iy anti^-Lini-j n :■ cu" rrimunospe :,i t : c tor (1); 

{•d a :aeth-jd ol' treating ■»! ar. irpiii >/ idua 1 in need ol' enhanced 
activiM/ 'ir expi'ession c-f ^I), c^: mfjris me : 

(■a; adniiriicterinq to ihe iridividual an ay'inist t (1); or 

do proviri:o;i ^n fh.-- nd : ^ i :1 _ia i. 'Ii; .n o torrr. to el'fect production 
o f t ne 1 c t i v i r y f ■ I ) in '/ i 'lo ; 

( -1 y a ::"ieth'jd b..)L r a i n .joi individual i.i''_nj n-ed to :.nhir)it 
=:':tlvify c^r e>:ruess • 'nn vd" 'I), ■:oirTip r is inn : 

(■"i; a-d::b. rii s ter n If :in annao:ini;:^ t.':) ■ I. ; 

(ti; ailnini -^^ terinq h nicl-:.':- acid molecule f.hai: mnioics the 
e;-:press ion (11); or 

io.) a'drainis ten ri'^ a polypeptide tnat conip'or^js wicn (I) i^nr 
its Li-;iand, 3ub;^trate or r-jcep it -jr ; 

("O a merhL'd fc-r screening no identify ::cmpounds that ac:^iv6t6^ or 
tnat inhibit the function of cimpu-ising : ne :f tne 

f ■.>ll':)wi ng : 

(a) nieasuiing the tdnaing a can di 'date C'^nipound t-: {!) 'Or to the 
cells ■■■r nLemorar.es b-iorin^ (Ti a f a : ion protein of it, usi.i:? 

a larjei directly or indirectly assc-:iated wir.h the candi::ate comEi-ound; 

{]■.) i:iie asur in:: tne bindin7 a candidate ■o-oico^und to (1) c-r t.o the 
c-rills or riienibranes i^earing (I) or a fusinn protein :>f it, in Lne 
presence c-f ."i labelled c-imr^et i t cu: ; 

testiriC if t n-;: -i-ar. di da t e C'jmro.o^iil results in a sijn.al ger.erated 
b y a c 1 1 V a t i ■:) n or i nr: i t' 1 1 i n o : t n e polypep tide s i. r. i d-a t e c t i on 
systems aji'pj r^ap)r i a t e to the c-a^ls C'r ::ell raenitjranes ;jearing (I); 

{ :i] mixing a oandid-aoe iijiixo^und wiori a ojlutin:a a-jiita i;d. ng (I), 
measuring activity of the polypeptide in the mixture, and 
C'jmp)aring tne activity t-: a standar^d; 

(e) detecting tne effect cf a candidate ccnuri'jund on the proauction of 
mRNA ei'icoain'.j (I) arid (I) m -.^ells, using' e.^. 'rOZyiue linked iiuu ^nc soir Lian t 
assay (ELISA) ; or 

if) contacting a ccmpC'S i t ion comprising (1) wi : h :he ccaip'ound to be 
screened under conditlL^ns p^ermitting interaction hierween the compo^und and 
the polypeptide tC' assess the inter act ic>n of .a ■:njmpO'Und, the 
interaction being associated v/ith a secc-nd c-: mp-jnen*: ^apatile 'if p-rc^viding 
a detectat'le signal in response to the interact i^ia -if the 
polypeptide with the cr^mtaound, amd determining if the comp>ound 
interacts with and activates or inhibits an activity cd' (I) l:y detecting 
the f>resence or at^sence of a signal generated fiiam tne inteiastion of the 
C'.'mpiound with (1); 

(o) an agc'iiist o-r an antagcoiist cf the a-rtivity or expressicTi o-f (1); 

(7) an exp'ressi'jn systen. iric-mprising a po. 1 yiiucb-ot ide caioaljle -af 
producing (I) when the systen'. is in a cC'iripiat i hd*-- hC'St cell; 

(-) a hC'St cell ccmprising the expression system f (8), c^r a 
m-.-mbrane c^f the cell expiressing ;1); 

{'■r^j a prC'Cess fcr prC'ducinu {I), oon.prisinq culturino a nost ceil of 

Cd; 

(10) a r-rocess fo-r pr'jcucing a hcst c^vl I comprising the expvression 
system ■af (3' or a memt-r ane tne cell exi-)ressing (I), co-mp'rising 

t ransf LU-ming c-r trans fee ting a cell witn an exr'iession system ccmprisino^ a 
p'? lynucleiat ide car-able C' f priducmg (I) wh^^n p'resent in a host cell; 

(11) a nc'St cell prsMduce'd by the p-rc'Coss cf {Hi or a m.embrane of the 
c- 11 expiress ing ( I ) ; 

(iij a computer readable me'diun ontc vdiich is st'jred a memt'er 
selected frc'm: 

(a- a p'Olynuc ieC't ide oc^mp'rising the Ijz-:) r^p ; 

(b, a polypeptide compiising the 341 ciminc- acid sequence; 

(-:; a set of pialynucleot ide sequences, at least O'ne which comprises 
the 1016 h-p sequence; 



■d) a set of polypeptide seqi^eiices, ci*. lt-c.:st oii^- ot vvh^ci: 
C'MTipr sea tY.e 341 amines acid sequence; 

(-) a Ciata set represent inq a f 'CvL ;y nuc.lec^t j ce ::equenc<:S C'jrnpr i h i nq the 
l-[--2 6 L'p seqi;ence; an i 

a oata set represent inq a f'*: 1 ynuc Lec t (:ie ;:;equenc<;i erLCodmg a 
polypeptide sequence cC'ru^^r rs ina the i 1 amin':- <iCi.d se^quenc^e ; 

(13] a L'c:uputer bas-d i'ieth---i f<:.i perfc-rminq h(jmo:.'jgy idont i f i cat ic^n, 
c onip'r.i siriq }:rovidinq a p'jiynuci^-: t iae sequence 'ri^^nijr 1.;^ i nq the 102t bp 
c-querce m a corpputer r^ari,--w.)l e rnediarn, an^:; 'r-ornja r.i ng :he p'j I ynuc 1 ec^ t ide 
S'-quer::-e to at least c ne yr.U'- . e-D^ .: de or pc [ yr--p^ id^i: sequence to 
1 :ior.z X ty Iv.-iiyjl o^-yy ; and 

■:c.mputer ras--c r[^--:n. : b: .r p:)ly:-.u : b *: Ide .issembb/, cr^mpr i s i ng 
[-■r ovic: ::q a pe.l.ynucleL : id-: s-.-qu^rnc^;:' cc>r.pris:_nq be bD.b;- bp sequence, in a 
co:uf'Ut-r reaaabie medzum, an-u S':u-e';;!n : n-:; tor at b'rast ^;ine C'V-r lapp) i nc 
ie::ic^ri h^et\-;H>Er. the o :'b '/nucle-t ice :^^e^:ruence anc i secor-:; icjI ynucl et-"^ i de 
sequence . 

ACTIVITY - Ant iii'^cteriol ; .■ y" 'riS t at ic ; un^ialcer; in^i a rt hr'i t i c ; 
r-ulnerary; i njT'.unC'Supr' res s we . 

f^Ri'^HAd I OF ACTbjM - Ttie pr':re:n is i :: 1 y :'':ip':r-ot^"^ase . 

bbE - The p i'lyn jcb^-tt Ides jr:':i p'ld ypep: ices are used as research 
reacents fsr the dis:-ovei~y ol treatnients and didqnosti::s b:u- diseases, 
l.articuiarly huiui^n dis-ases. Tn^ polyn..:c let-: i i'--s rn^y re used as a sourse 
I f hybr i sa : i on prot.'es, t'")r ss;:eenir.q geneti: :Tiutati^:ns, und serotype, 
or:d f'tr 'jr^jani SJT: chrC'rijjS-siTLe ider;'. i : ivrat ion . Th^- psl yp-rf^t^d-s can tie used 
r ■:. pr^j-iu-e ant ir^ : cies , in -/:i.i::in'- r i riiiub:: ^ ior.s , ^ci ic-rntiiy .^gic-brtc and 
arit agonists, and to screen for antibacterial druqs. These are used to 
prevent, innirit or ::rea!: -iiS'-^ses, ii^rt iinl ir ly it H-r I i'.:ujij a ::te r p.ybiri 
1 n t e-:d- v..ns, such as gast ri-mtes : i::a 1 car t in :::![ a , ■:rasrri:: u] :.-:rs, an-i 
gastritis, diseases d ■ iigni-sed, '.reated and p re\'ent - y r rie pri'duits 
:n^dU'-:--^ ■: .s t e ir.V'O 1 i 1 1 s , sep'tii ar:r:ri^LS, set^i ' 'di )-[;;■: old > i: i t i s , Mute 
ta-teii.:.! eni:::caiditis, b t iphyl: : c s - 1 fo:ci ojis :nin-;, s-ai:l-d stir. 
syndr.:/:ri^, aii-J. t-jxi:: shocf: synar^mi. The pidyoefA i :l'^-s ::-an alsi he used to 
treat w-iunds ari'i body iniLdarits I..) pi'-verx i:)aiterial i^:;rles I'ln and 
m feet ion . 

ADVANTAGE - The frequency ^f S t apbiyloc': c s j 1 iri f e':t ions nas iiisen 
■iramat ically, and isolate^i S. aureus strains are increasi ngly resistant to 
staniiard antibiotics. Tne gi ycor: r.:.t ease {gcpO pi~odu:::ts can be used screen 
for nevvT antibacterial comp'Ounds that may target these resist. ant bacteria, 
['wq. 0/ U 

FS CPl E?I 

FA AB; DCtJ 

MC CPZ : E?0'1-C01G; B04-Eii2E; B04-E0;:;; &04-F0100E; B04-t;0?; B04-b05C0E; 
Bll-C0'/A4; Bll-COyEt; B12-K04A; B12-K04F; 

Bl<^-A01; B14-A01A; B14-C0.^; B14-E0&/ B14-E10B; &:4-<::0j; B14-N17A; 
B14-U17B; B14-S0t; [-Ot-CC'b:; DOf.-HOl; ['05-H07; ['-S-HQ:-:; 
D05-H09; D05-H11; D05-Hli; Z'OS-Hl^E; D05-H14; LKi 5 -H 1 7 A3 ; 
00 t-Hl& 

E?I : oO'3-E14H4 ; S03-E14HL ; TiJl-d'.ltB 
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Au 2(:)00-j04^r^7 [18] wpi:-: 

DNN n20C'0-l':'2209 DNC C200'J-f>; ':04 1 

TI :;atab":i3e system tor st'jring bi'M:iolecular se-:pience information in manner 

which permits sequen'::es to l:.)e -a t a Lcx-jued an:i seauche:! ac'iordmg to one or 
mijre protein functxon hi er a r : hi v-s . 

Dd BC4 Ld') TOl 

IN A?:ERBLc>M, I E; ALTUS, C M; Ad-'btdNG, J; HILLMAN, J L; P^LINGLER, T M; 
t:ASLY!], T J; RUSSO, F; SEIbHAHEr, J J 

PA :incy-N:i incvte p:tafm iNd 

CYC 1 

pi dC (d;-.L3o5'^ A 2 0 0-.)020o (2 00''d-3;" J 9p GO 6 r 007-06 

ACT bG (-'[2365,^ A Provisional UG 1 0 -28284 19961010, Provisional US 1096-32563 

I-.O^d^'12, NG 1 397-8 12.': 90 19 9"03-' 6 
PRAI dG 19^^7-812290 19970306; UG i:-9o-28284 19961010; US 1 996-32563 

1 b361212 
IG IGM G06F007-08 
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[■ ]■'■■■/ E L T Y - At least s : :i i o c : ' t \\ e s e q u e r i c es .In t h e d a r <_u b a s e a r e r o u p e ci i Ti t. o 
a tirs\ r.Lerarchy ■■: protein function c^ttiqoiie: : 
srecifyii.g bi'j i oqi .;a 1 functions oz proteins 

<::■: rr esf; cmding to the se-:;j-nr--3 . The f:Lrst hierarchy includer first and 
^;econi s^-^rs of :'at '-q-: ri eo :?i:--'C i f y mg functions <it a celiulai 
level and at a le\--: :ib-:v-^ r ::e --Alulcir level r^ -spec t i vely . A user 
interfae- all':»ws a u:::er r selectively view the in rorrri.;;t iiTi , 

r'F:TAILE2' CESCr.I F TI '2Ii - The sequences may incl ice nucleic acid 
yequenceo and aniin .. acid ceaaenres. Peci^rds .ncli^da descriptive 
infC'r:tiati fr-ini an e>:*:err.al datatjase t'^ wnich tne Lo:nr:.x'rCular sequences 
are :-:rr-lacel. Tn-- inaL i^Zjli^-is- mi:y oe Gen^av^K. In*? c>:'Ccn.:.i ;:er of 

(• ^re ;i':-i ies srecifi-s -;i;-al functions it an s rcinisir. i 

tissue level^ At. l-^^ist .-\::ie -f tne 1: iom-: le :u ■ ar seq_:.en::es are grouped into 

a sO'SC'nd nit-rni'Stiy j\ protein function sa'^eg >r ies . 

T.h-^'S':- .T^ecify either '".n^;- r.^decular or enzyma': L^:: functions of 

proteins cor r espc-n-.i i r.g trie se :;i^enoes . Tne interff^^e all'iws tne 

user t'j selectively viev; information regarling a suii-set - r-.e sequences 

which is qrc.utred ir: soleited cate^iries from the first and second 

hierarchies, or sequerice rei;ord3 associated witn a sa^egory selected from 

t ne first hierarchy, cr ^Tategsries i r^om tne first nietarchy that are 

ciSS'i ::iatei v/itn a seje-^r'-d s-quen-e r--S'..r.i. 3ea::::he.:. may ;TLade using 

t:i'th tne presence ■■-f keyv; i!':is an; *. h- alis-:;nce or ant i - keyv/.irds . 

I!ir'EVE>i['EMT ThAIM s ir^slud-el I '-r a metU'Dd and simjV'uror readaole 
ri-;:di uiu 1 'j- L usin:| a :':-mr:'Uter t. :> j:reser.'; miDrmatim o.! jm -z'SU i ai 
sequence rec^i^ds. 

doE - The sys-err, stir^es ana retrieves h iol :>-ji - 1 :.n f cu-m a t ion . It may 
y.r. used ti- Cat al'jLiue ar\\ra] s-'qu-n"^es, e.j. namari, primaie, r-oient, 
ar^ohirnan, insect, etc., riant: s-^quences, ar: :i m:::r'joi.al se:^uences. The 
svstem n.ay ia- used t 'rSt:ii:'lish a chA profile fsr a giver. : issue 
,,nd to evala.^t-e c;. iu :;o;:. :j^:no ..nr re ss ion caus-?:l by :;isi-.se progression, 
}ih a r m a co 1 g i ■ a 1 t r e a t. me r 1 1 , a q i n a , etc. 

ADVAIITAOE - 'Ihe syst -m aliC'Ws sequence int.jrmation to r;e catalogued 
and se.zirched rapidly ai'C'Ordmg to one or more protein 
function hierarchies. 

DESCF'JPTXON C'F H'RA.WT IIG ( S ) - Tr.e drawing sh^-ws a block diagram C'f 
client-server Intranet fcu" ^■rovidmg database s^i^vices. 
I'VKj. 
FS CF'I EPI 
FA AP; GI; DCH 

MC CPI: B04-Bn3C/ B04-C01; P(")4-E01; B04-N04; B1I-CO8; Bll-COBE-l; 
Bll-':09; B12-K04A3; B12-K04E; Ei05-}102; 
D05-H09; D05-H:2; DOi-tlieA; DC5-J 
EPI : TOI-EOIA; T01-J0^3B:B; Tijl-JOSB^M; T01-S03 
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TI C'etermming function of protein c^r polypeptide 

■iomams, useful fcu' ^iru';! discovery for proteins mvolv-ed m 
neop'last ic, metabr li r, n-urL--.egener,jt ive, ca rdiovas^cui ar , p- ychiatric, 
inf lammar oiry and : nf e^rt 1 ■'■us disc^rders. 
DC b'"'4 ['16 TOl 

IN ANDEr30N, :1 ; HdAN'"-, V; [-MnTETI'jNE , G 
PA i^UTF) 01:17 FdTGEP.G 2TATP NEV; JEF.SE':^ 
CYC 80 

PI W' ■ 2:;00vC.'54 14 Al 1. 0 2 ' .'3 ( 1 C.kjC 1 1.) ^ EN 71r: Cll'lt" l-'::3 

F.W: AT BE CM CV PE I'K EA E2 Fl Fr (^B GH GM GP IE IT KF L2 LU KC MW NL 
1-A PT 31' SE FT. 2 2 GG 2W 
V;: AE AL AM AI AT A2 BA BP BG B2 FY GA CH GN Cf PE Jb EL l3 tl bB 
GD GE GH GH HP Ho IP IL IN IF GP KE FG Fl FP F2 LG LF LP LS LT LU 
LV 'AL- MG MF hWl MV; MX NO UP P2 PT FT' PU GP SF G''; SI SF PL TJ TM TR 
TT GA GG UP UP GN yn ZA 2VJ 
A'l -^viPllSS A 20jt0214 (2i-j02:^) t:i2t;!-i-6& 
APT VJi 2fj000(i5414 Al \\0 1923-nGlu417 1339J721; AG P^51i5t A AU 1999-^115^ 



FOT AU ^*;^5I155 A Based on W' i 2 >: J J -i 1 4 
PRAI U;: r.:^^^-l 81601 19:^n:-9; m2 1^9:-9364I 

ic iCM (::i2ooo:-^8 

I':3 C07K013-00; 01 2 PO 1 .^-34 ; G06F017-00 

AB Wj 2 20') 'J '■4 14 A UPAB: 20-^0iJ617 

N2)VF2;/rY - A rii*:-thC'd for let.^ iTri i r: i r. a i:'i'?ch^:imi-:-al function of a 

proteins or polypeptide dc.^Kiain co:\i^-ricOz i':lent:. i f ying a 

stciTJie polypeptide d'jia.nLn and k'j, -mL- a r :'-i:iim-:-ns ional (3D) 

o r .fa':-^ ur-r- da t -i-iTiii ne-:! r^y ni^olozr iti i a net rosonan^o (NMR) to a known 

str'-jcz^ire , nev/. 

rp J ^ r I £ 2 ' " T T I ' 'M - A '~ 'J 1 h 1 a' ~ t n r C' ; o' !~aj a' 0 ria'-r the d f c r d & t o rrri i ri i r. a 
a bi ijciieiT.i o 1 1 function ' f .3 protein c^r 
polypeptide o.':rT.ain uriKr.'jwn function 'rC'injor i .res : 

{a- identifyina a V'a^at.ive polypeptide a^ma in that p reaper ly 
f':dii;:- mt-j a stac-le polypeptide aorr.ain, the stahde 
polypeptide having a ae lined :'L' st r u r.t ur e ; 

(b) deterrr.inin'j :-D s^aructure of the stahde polypeptide 
d':-niain fr'om an auttif-ated r:inalysis of MMP, S]jea-t rainet er spfrctra f the 
polypeptide 'i'Mtain, where the aut'an,ated analy.ris is cc^nducted by a 
N0iF2oYAs s i an p-r ocess ; 

{o; ooiQfaring the '2et ^rmi ned IL structure c- f the stahde 
polypeptide dcrrvain t: Kr.:v;n ol' structure's, in a protein 

'iita tl.tinr:, wheie the t^Miiriai i sst. identifies kn':wn structures witnin the 
protein Liata o^:xn^. K:\^o -iie n^jiLC i c jl u i t^' tne ueLeiiiuned ITj 
structure ; and 

{r{; cc-rrelating a id c-chemi ca 1 function -tc- rresrMsndi no to the 

Loent i f 1 ^rd h.-MuC'i'Sg-^us s *: t U':-.t u r e Z'.- o h-i L-:-henii .-ai function for 

the stiibie polypeptide 'iC'inain. 

IMIiEPSaT'ENT COAIMS are als-t included for tne fcdl-^winc: 

( 1 ."' i t g ^ 3i ^a t^iu f r ^^ardd I'li^"*' e n i'^'-s^ icai '"d a b i 0'i''"'errd cal 

function of a protein or protein a^rriain of 

unknown function coiup'! i s ing : 

(a) a first computer algoritnn. capahde C'f parsing the target 
P'aiynucleot ide (tPM) into- at least one putative domain encc-ding region; 

(b) a iiesignated iab for expressing the t'Utative dom^ain; 

(cj a UK?. sC'e^rt romet er f<:-r det eiiuining i:idivio:ial spdn rescinances of 

amino acids of the putative domains- 
id) a data collecti<tn device capable of roiiecting NMR spectral data, 

where the data collection deviate is 'jp>erati ve ly tc-uinied to the NMR 

sp^ectromet er ; 

(e) at least tTie compiuter; 

(f) a second ccmputer algorithm cap-abie assigning individual spin 
resonances to individual amino acids ot a polypeptide; 

{gj a third com.puter algoritnin capakde of determining tertiary 
structure of a polypeptide, where the polypeptide nas 
had resonances assigned to indiviaual aminc' a:ids of the 
polypeptide ; 

(n) a database, where sta'red wiohin the ■iiBtHC'ase is inf'jrmation about 
the structure and function of knC'wn proteins ana 
■iet^^'rmined proteins; and 

(1) a fourtn :^.omputer alg^arithni tap>able ot determining 3D structure 
n'jm'jlo^iy hie t ween the determineLi 3lI str-u::ture ot a polypeptide of 
unknown function to 3D structure ot a protein of known 

function, wner^e tne protein -af ."ino-v/n structur-e is sts)i-ed 

witnin t.te protein d^^tabase, where the fourth ramputer alaC'ritmn 

determrnes the structure by an £iut'jmated NO)ESVAssiqn process; and 

(2) a high-throughput methj:i for determining a bioshemical 
function of a polypeptide ot unknawn function 

eriC'jdei by a tPN cjmprising: 

(a) identifying at least -ane putative polypeptide d-jmain 
encoding reai'jn of tne tPN (parsing) ; 

(b) exc>ressina the piutative polypeptide domain; 

(a) determining if the expressed putative polypeptide 
domain f^arms a staole polypeptide domain with a defined 3D 
structure (trapping); 



( i] icr -rinir.irig the "'D structure; of the : •:abiL, polypeptide 
Jornain by an autDmatei MOiv:' YA^^i; i :^n process; 

;e) ccimp)3ririg trie i-:- ^ rmi ne i M) s t ruc:t.urf;^ ot the stal:)le 
polypeptide ■:i.)main t^") knev/r. 3D structures m ci Protein 

j.:tta Bank to de t':;iuTiine vjrr^rner any such krpjwn stiuctures are homcic)gc)US to 
trie dete ruT\i ne-i structure; and 

(f) C'Cr re Lat ing :i hi ■ ^ ::cienii ;:al function cr. r i -^srcsn iiri'':^ tj the 
n Dm'jlog'jus 3t:"ucture t'j i; :.>..'jsnei[ucuiJ, function t'jr rhe stable 
polypeptide ci'Cma i n . 

'.ISE - Th'"' nic: n'jds h- 'r:''?:! t cr e ^ u ::i da * i r,-g :h-^ function 

proteins arid protein d .'rL.--i i ris , p rt icuia ri'_,' rsr dru i 
::: s ':'::.) very p' j r^pce;-:- :■ . 7n-'y -.-r: ^'C ise:i i'jc :i-jia-r:jn.: fr-rr, proteins 
>-netica..ly i -Lr* 1 i a: ed i :\ :\e :'p L:i.:- 1 :c::, :net a;:' i .: , ne Jic; d'^.i^j-^rnera ti-^e , 
: ir'isic'vascuiar , psy-:ud- it :c. .jr-i :.nfd. amniat.-:)i y aiS'jrdei s, suc:n as 
AlzheiTner's tj-: ^ peptide p:^-';;arc:: >r protein c-'.r'P), s^r 
.l.ima.i.r-3 r ro:n proteins is s slc: a t e^:! witri inf e'::r. i sus a:(en'Cr, e.u. 
::.uc:erii;, funui an:! virusv-s. 

FS EPI 
FA Ab; DCII 

MC :;FI: Bh';-M()J; Bli-'^O-iA; B12-K04A; D05-H09; [M,t-H12; 
Oh ^-H 1-1; L'0^-H17 
: Tul- ,T 

TSCH npT>:: 2 0000 :: ;,;) 

TFCnt':!OhOMyr FO^'^^tt - F^OTF-OkiO'jLOm^V - ?refeii--o : ao).;i : i'he nc vel m.-thod 
rurther comprises the pr--step> or pEsusin'j a tar-.^ei p- oi ynucd eot ide intc at 
I'oast lor.--- pu*:at-io7•^ polypeptide -d.-msin. Tl-:': larsinu is p'tor fh rme d 
be- a fir.- t .; mu^ ut -i-x :i ic :■ i ± t lea s-- 1 e-:.: : ed fi mu a ■.: o:tip ..oei .:ii-:r: i i t hin c.:^;^-)-^^-^!^ 
o: det t^rniinin :4 excC; phase l:-':'uniluries of a p^id yn a^:- leot i de anoi a c^'iinpiuter 
'ziqrrizYdh cap-atd^ -id d-^r lurc n i n ;r ir.*" er 1 ^na i I'l ;:''ord^ri-s -ncjd'E;i :n a 
P"' 1 ynu'": I 'rOt 1 d" . Tn- ioe'f:h'OC luitn-.-r -^-C'lupr i so s ^ inr:>ut ■;; r bjo>r i. t hn which 
■::rnpares tne r.utative polypeptide domain sequcr.ce \;bti: kr.O'wn 
*i-.M[i.din sei.pa^nC'ro st'.ue'.i m a Li.::^ t ao-zi^e . Th- Mi-'O sp>e.'tro; .iiie- .::;iiaiyceG i^y a 
second c^Miiputer aiaorithra which aut omat i sa i 1 y assi'Jins reso-nance 
assignments to the polypeptide sequence. The i ''.lent i f i I'at i 'jH sr.ep 
co.mp'rises measurino^ a tiroe C'Ourse oi amiae h^-dr^i' ^en-ideut e r 1 urn ex'.:hange . 
Prioi tc the ^.eoiC'iiO st^p' tl"i*:r polypeptide is C'S't iiual 

S'".dubi 1 i zed, c-Mnp'ri s i n^-j p'rep'aring an arrii^' of nd cr-odiaiys i s butto-ns, each 
containing at least lmr',^r'C-l (sf an afip^roxiniat •? 1 y ImM S':iutiC'r; the 
polypeptide doiTiam, dialycing the members of the array a^"jainst a 
i.aiflerent dialysis buffer, analyzing the icialyced b'Utcons tC' determine if 
the polypeptide diomam has remaine'i sc-lubde, and selO'Vcing the 
d'omain having ■I'pitimum so'luh-i li ty chara-ster i st i cs f^or I-'MF. sp-ectroscopy . The 
c<jmparis-jn -jf the :'oO structures is p^er formed by a third <romputer 
algorithm, oreferatiy DALI, CATH or VAST, .rap^akde c-f oetermming 3D 
structure h'Smcdogy t-etween the deterEiined oiC' structure ana a member O'f the 
protein <iata b^ank (PC'B). 
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hU 2u0i;;-130'' 97 [12] WFIX 

DNN ;JsO(::G-li:i^313 FOJC '^OijOh-O^ 1-90 

T2 Ari efficient, aco^urate 3:nc rap-i'a ce^mftuter dat.ahaso^ for <:-st imat in';; 

protein functions e . ■;: . e:i syniat i ■:• artivity, fc.r 

polypeptides -.'L't aine'U tr'M:i --jene segv.en -:e t r.5ins la t io^n . 
DC Bn4 cgt:) tOI 

Ih :MAPniF;A, M; ITAI, A; ITAI, r.; ToO'!I00-;A, N 

PA iMEId-U) IHOT I-lEl-JC I dAh MioFFCODA:-. FFSD3N i:;iO 

CYC 

PI V/i) 09620j^ A1 19091202 fsOOOlc)* JA 16\:' GO-F0'17-3j — 

F.W: AT 3F ^OH oY FF DF FA FC FT FF G3 'OH '^-F-: GF, IF 10^ FF LG LU MC MW NF 
<jA PT GF' of GO- F'G 2W 
W: Ah AM AT AM AF BA FB BG BF. BY GA GH GN GF G2 2'F DP: FF FS FI GB GE 
G:l G[-l GW Ht) ID IL IS F:F Fi : FF F:Z LG 2F FF h.: GG LL' L7 MD MG MF MM 
MW Mi: UG UZ FF FT FG' F.F Gl' SE SG 31 OF SL TG TM TF. TT UA ')G US uZ 
VM YY FW 

Ac 987451 3 A 199'n213 (20002 j) G0uF017-3C 



ADT WCj 9'.^6^0i'14 A1 WO 1 9 y 8 - J F ^ U;"' I9'^y0b.i6; AU ':*H"/45ii AU i9r^b-74 5 iJ 
1 ■^98('5;:'^, WO^ 19?'8-JP2:0. :9'^P05;'6 

FLiT A'J 9^^74'. r-: A Based c-n W' - ■^9^.;. OU', 

PFAI W'' 1 ';^'^X-Jr-\^ ?02 l'^^".;;9^h 

IC r''M G06F017-30 

AP Wi ^ '''it..: i:)4 A VPAR: J i ! i li ^ 

f]<''VP]/rV - A dtit.st'a:-;^ ':':■:.*. .;iri-.nq i.nt <;)rmat: ion on t:ie amino ocid (aaj 

-quf^riOf^s ot proteins '.'i v/hi j.ch or more b ic i cjg i-oai 
functions are kncwn is riV'\/. 

ijET/^: LE:- IjESCRI PTI' I] - ^h':; iiatataoo also contains ad<:iitionai 
: ri t c rir.a': : : n on the s :^<:a;^ <-.)f :.mpc> rt ance of escY. ria residue in the whole aa 
./ '::queriL*e in oet ei'ni: n i r. 'ho ':n<::v;n L-^LlooicaL functions. 

An ■ IIL-EPEKZiSir:' CPA::-! i.- al:--o jncladed foi' a methc^d ot preparing an 
alxgnment P^etv/een aa loor.r^f^s c-L-nt ned in the datafcaae .rmc; those 
..I the unnr. 'jwn polypeptide. Thi- i.s lepiesented -iS th*E hc^mology 
,i:tC'n<;|st '.'nri'Mis si.tes, • loh. }-)eir,':: i 0'£ nt i f it;;d as navino a riioh score of 
iT'-nc rt ari'.>r m determiriini }:'i:'^.errd aJ Led oqi :a.i functions. 

nCE - The method is Pc^ enahd e an efficient es t im.a t ion of tne 
L- LolL'^^oa :. functions, r .j rf i eular iy enzyifiati: and signaling 
a-rt i vit I'.iS , of ( un ;■ knc-v/n polypeptides from their nucles-t i'lie 
s-gu^.-n'::es . Stiitat)le proteins car. then be isolate'i ana p'urif ied 
l-.v v-nrio-ns means. Thi i s ■-■■■Mid be :.'t TL-ns ider ikde ase in a ;;ii':dog ica 1 and 
m-^aj ';-ai ■■r:v.t ext . 

AL'VAllTAtiE - ?he riput er i z f.- :i pr-ioedure i ^;^fficient, fast anci 
■1- -cur at e . 
L'wg. O/i 

FS CPI EPI 

FA. A.B; PCd 

■"!■■!: B'"-1-P'"'^ BP^-P'^PA; BO ■! -P(' ' ; B04-N04; p]d.-:08; 
B12-K04E; D05-H09 

TECH UBT:-:: .:':'\:PtP>j3 

TEPK1I0P<J-:'PP Ft'CUS - ::PPTP:JME1ITAT:0N AUD T^ESTIPG - Preferre'U Database: The 
I nf c-rmati'tri contarned m the database is part i.::ular 1 y constructed from aa 
sO'iruences i-f proteins witi"; P.nov;n ba.'jlogicaL functions, 
particularly mformatior. rei.atm'j t.o the s ter eos t ructures of the 
afplicable proteins. The aatabase is incorporared into me:iia 
with meniC' r y f o r use. 

Pt^f^rred Method: A sea rot, firccedure can t-e cpe rated with tl:ie database to 
find homology between a protein m the database and unknown 
protein. A high correlation can t^e obtained P^y using sequence 
noups which contain 2 or m^jre C'lntinuous aa residues of hicjh importance 
S':0're in e:-:hiPdting the Pnoio-gical functions. A final 
determination of hO'moiC'gy and p'jssiole identity is obtained by the 
alignment of a sin'-jle protein if: the lataPase and the 
arget protein. 
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TI ^n-.timi tat I'jn c-f agents t^:- r*-'generate biosensor 3irfac(;s. 

PC Bo4 I'lP C'l-; SP3 TOl 

IN APDEP.SP-Pw P; HAEMAELAE I ! :EI] , M; MALMQVIST, M; RPOS, H 

PA ;p:AC-tn biac-jre ab 

CYi: 2 1 

PI Wo 9'^6j:.:-3 Al L 1^ 9 9 1 .c iPOC'j'Pj^ EN 131p 301NCP:7-327 

RW: AT BE CH CY L"-E [o EC FI FR GB GR IE IT LU HC NP PT SE 

W: AP TP US 

A'.i ■.'94Mtp:.-^ A Rt.Ji;-*!^!''! ■PLMyjc:.) GOIP027-327 

ADT VP.' ■^■ri6"-V:-3 Al VJO 1 :i - oE .^2 1 199.0531; AU 994bP5& A AU 1999-4 6658 19990531 

f:jT AP ■t94P^d:.9 A Based or\ 9>:i333'^ 

P?AI 'P: 1.0.--t74u2 1:)-3':P29 

T C I 1-1 G 't *; ;P7 - 327 

I'^S C:::C2':)l-00; GO IIPJ •■3-5 4 3; G06F019-00 

;CI G"'6F159 : C'C 

AB W:* 9963333 A UPAB : 2^":pU0215 



NOVELTY - A method for selectmq an optimi::ed re- lenerat ion solution ror 
tr.e i^eqenerat ion jf a tiosensnr : virface ha'/i:i::? a su r f ace-bC'Ur.d lioancl and 
an anaiytr^ associated with the ...iqand, :.s new. 

jL:TAILELi rJES':'P.l E'TION - The nieth'jd :r>mpr .s-':?s : 

(l) sequentially oontaotinq the tioieiisoi- with firs*: reqeneration 
c-:":;<tails, whiore ;v-n '^ne is m ,;':me-:'US d:: u^: I'.ri (:oiui:)ris.i n^ >ut le^st 'Sne 
aoidxo, b.iSic:, i-'n:-:q -jrqanic, d*.;ter::en\ or di-rlr:in<:| s^.c.ct soluti'jn, and 
whert- at least, one ■: 1 f:he 0'jo^:tavl.s v-cmra' ist.^.3 .j :iri:-:f:r^? of at least two of 
tr.e stoot. solat isri.- ; 

(.., ineHroarlrq : ne re ;jer.era t s-jn ott-i'-t Zjv e xcri ■■. t the co-'S kt a i. i s *: o 
de"*:erm:ne wni sh ^:l^'e tne hioh-s" n^easu r---:! req'.?:;e rat i^:. n e!'f':'L:T; 

c:Ofr*iils n.;:S/ii":q tn-;.- hiifnest r.r'-nS ..re-:i r-- ;ien' : :ri *: i- ^n elf■■■■;:^; 

f.;. ■::oni-*inL:-.':| ^ne st:ck sointi'jos in c yi:-- r^v. >s t "> :jeneriite 
ao'j^ne:- s--: rf :r 0!^^r i:: i : n s . o*. u 1 1 ,: ; 

f": /e;Tient : ally :r:oi: ao t : : . j the b i ■: sen.--:.::;- s.rfao- w::.th each -f tne 
new cco-.ktiils; and 

( ^■:) aet-rminin ^ the reqene r ^ t !■ jn eifec^ oi vMcn c^oktail, and 
iaentifyonq ■.■ne :is tne -jpiimized i~e >;.-r.era ti'.-ri ssluti':^n. 

ItlLiFPEtlLiENT CLAIMS are also rnc^u^led f-Mr tn- f o M'jw:.n-.j : 

(1) a ia-thO':i btr cha racte r i. s mq a liqanLl an^lyfe associated with a 
t ' i s n s ■ j r s a r f a ce , ■ jmp r i s i ri':^ : 

in) osnractmg the bi:sens:r s^;r t m riir:^:.n:| a i. '-^-..roi L: aand, 
sequentially with or; ara :^t e i 1 za t : on sjlutijns; 

(:.■) : n t rc'duc ire.: tr.e ar.al'/*"^;- li.n-: > -r^.iv. :i ^j-ut^jn, ~ -ntei'act v/itii the 
tioand iigand; 

(0) riieasurinq at leait '.arj jf., jss-joiat i. "tri or sS'tciato:. ^sn rates, 
sarface bounl an..ly*.e sonsent ra- on, arri re ::ener u t ion v-tte-t of tne 
ar:alyt.^- and li-?^r.d .nteraotion r ■"O' e:-:oh solution; 

(d) . :nara-:'t or .o:inq the :.,t.na jr c.nalyte ir.ai Ln-j res:..ts; and 
opto jnally 

(e) ooniparir.q *: he chara'::ter loat : 'ju '.f *. he J. i :fand -ind/sa- !inalyte 
associate'-J w:.tb tne biosensor sarfase witri a set ot pre'deterramed 
characteristics ot id.:her test m'S-OCules, and preHctrnq their activity; 
and 

(1) a qenerated data siqnal, or computer laenory ■:.:.)nta i no ng a data 
structure, useful rti^r corrmunicat ron of chen:\ic;il pert ?irL)at is'ii inforraation 
associated with an analyte-1 igand in^.eiact ion, cj^mprrsing at ^east one 
kinetic parameter, each expressed as a na th^mat i'ja:i. mj'.iel describing the 
relation between the parameter ^nd analyte- lagan 1 interact icn in 
characterization solutions, so that the data can be used to C(jmmunicate 
chemical perturfcat i^.n inf ormat i'Dn asso::iated with the anal yt e-1 igand 
mtera'StiO'n . 

USE - LiganrJ-analyte binding pair inteiactijns are used ,:.n assay 
worK. They rncluoe antibody-antigen, h<armone-n'jrmone receptcr, 
sense-ant isense polynucleotides, avidin or streptavrdin-biot in, 
enzyme-substrate or mhih^rtor, iectm-mat cnm :^ saccdra r ide , lipid c-r 
P'Olynuc-e.-:.t L le -binding c-r matching protein, 
rec^ept'.or-t ransKii r. tor , d!:U'"j taioger, protean-protein, 

L'NA-DKA, and D];A-F:]A. Th- pro-cess is use^:: t :■ break 1 l .:and- an 3 1 y t e 
inter act I'tns to r^-iv-rnerat e a surface .r^anr amma t e .i witn anal yt--, ft^r reuse 
in tne next assay. '[<any surfaces suitatd^- for -rarrymg a repaired Iraand 



tlv to 



prC'dU'-re, and must t^e re asec i l: P'::ssibie. As a n.atatvie 



example, :he biosensor surface is a goba layer, whion is eatable of 
■r^uppor tiinq surfa>:e cdasmon rescT.ance {SEoj, or rrher ■aoticaL let e.::t is.n and 
ussay techniguf.. Tne ligano is r'Ouna *:.hi.:. iuy^-r, iicefeiably insiirectly 

ugh a dextran matrix. Tne reo:very 'i-an': lae ■:o-: harsh, the llaand 
will i:.e lost. The r-cc-vered analyte may Ir^e separatee and collectea f:.r 
siiasecuent ■ath^-i- assays if reguired. Er::r. a recovery profile 
trom different scdvents as de 1 e r :riineo wi ■d'l kai'avjr. sarT'ples, the analyte tan 
alst te :har Hcter 1 z-d, even t c- c-rtect minuta -:ii f f e r-E n ;es , f-ar quality 
c-antr^:! ^-r -ath-^r r-urr-c-ses. Tne i:ateraction profile af 

li.gand-ar.alyte may iae used t :■ FTeiis: astivity :. t :he ana.:yte, t s be 
useful for screer.ing. The re genera t it n ana characterization procedures can 
be automated, us^ng a computer. The C'ampater is pr-aviaed with a readable 
medium as a flowsheet, which instracts ins trumentat icqi to prepare the 



"■irious :.;oiutions (unless, ini i a_Li y , a kit is a "cii i-iiLl^;; ) , and ioiiuw^ny 
t:rie contacting and a3S*:t^;s::ient Dr')Gcdures as t^jr ::ianual. ::onti:al. 

F3 '":PI SF-d 
FA Ab 

MC did: B:i-d')7A; Bll-dtHF:; :m-d:dd-:?; Rll-d08E5; BU-CO^^; B12-P;0^1F; D05-A02; 
ri':i5-ll':'9; OOS-HIt ; [)0-.-Hl :.; dO'5-H12DS; .."f d]-B''l 
[::?[: d^)3-F'.'4A':)S; 30 B-Kc-IB--; SM ^-F14H-1 ; 3 t-;-FMH9; TO L-J03; T01-JD6A1; 
TOI-SOIC; T01-S03 

TEdH dPT>: : .l'.'iJ-')-.Clb 

TP:::H:]<jLOdV FO':d:S - op.dAUP;' C[d::ilIJTRy - iT-f erred co-iponents : The acidic 
s .duM-jn I.-: :i niixt.ire 'C - :i:i.:: m c: : n : :d2-7, .mi M:.s pH c i: 1-7. 
3pe::ifiei -■ici'is in:-:..iid- 2-\'::m:>:. ir.C' r j ■; ni c , c:'! :1 ) jx y Li ;.ri:l suLford.c acids, 
.:i::Lino> fi:i'is, pneno^s ar.fi piirir.-^ .i':;i':is , t ne ?A-)y'^_ f'r-' r-:: ri~ed acid is a 
j-n xiur-r- ■ d' C)X-aIic, f:)n':)Sfjh-..' r i , f-^ccd '", ar:l ri:iL-:ci:.c -ii . Fne basis 

S'"*lutiC'n I S a raixtare 'id' Ijdses h iviri-j p?'--ls, ex ::mp include basic 
s.ilt:s, ar'Lr:icn Luru, i^saJ and zin:: ny :is jxides , Py ;ir axyl a-d ns' ani ny^irazine, 
t:ne nost preferred is a rnrxture --.d et. ha n<:d amine , S'^di'ini ph^jspnate, 
piperazine and '■|ly::ine. Th-- s;dv-n^: ]s pref-rahily a ruix^cre jf dimethyl 
snlfC'Xide, formamide, ethar.ol, are^nanit ri le and 1 -ii-ut .::n^:d . The detergents 
■iniDrii':, ^:ati'jriic, zv-; i t tei" i '.-n i'-^ .jn';l n'On- nirii , rn^^-st |:>re:erably CHAPS, 
Zv;itrerqent 3-11 (BTM), Tw^ren i:) {RTll) , Tv/e-r-ri 10 (P,TM), an:i I'riton X-100 

t::THn:LiTA, IL'A a:il crX'V/n etheis, :ri-:cr. f.^re t v ratj . >' EITA. .A ■jaisiinq 

S-: Lutii-ns e . . for KGl, njnp'C-lar 3ts:c siluticns may aisc ]::e included. The 
;: ha rac t er 1 zat I'jii S'lduticris ar- the :-:ii!te ^ yp-;;s (a'-L.di-.', L-asi'.:, i^jnic, 
■ jrqanic, detercent .jnd cr;e 1 u : ^ r. :: < as ^he re n-ir.e r a l i i-:: sS'lut ions an :i 
^:oc>:tai.Ls . 



TECKl;!lLt)';.Y FOdMi: - ! :!■:^^,dAd i 'dill'lidrdV - E'r-: ! -i^rr-.-l z .ir.p : nenc 3 : Ionic 
sz'luti'jns .are jzre i inarit .: y in^ i''par.i , th-[; zriranio: be.iri.T a::ezic an-i 
hal'jacet ^ 'js , arid tet ramet i"iy i a:rz:fjn luui, niC'St r.-ref eraijly iri'^tass i am 
thiocyanate, ifiaqnesium chi-i^ride, ure.^ and giLT^ni'liniuii cinl'Dr ^de . 
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IR 'VELTY - An iscvlat^-i polypeptide :A; select-.-a frfjiri: a 
polypeptide cc-ir.pr is at least 7'Y, -.'O, 90 z r 3^'' i-lentity tC' the 

719 resi'due aninij a^jid sequence :uJ ly derined in the sfjeci float icn, a 
polypeptide v;hich comprises, or is, the "19 resiaue se.;uence, and 
a polypeptide encoc^ed h'y a i^ecc mtd riant p-i'-lyn\-.cle!.jt ice :'C!mp'r is ing 
the lluO Lp L'N-A se<'juerice fui.ly defin^^d in tno sp'-^'-^i f ic "ut nan, i s nev;. 

DETAILELi DESCRIPTION' - INL'EPENT ENT CLAIMS are alsz- included for the 
f ','1 i owing : 

(1) an isolate':! p^C'lynucleot ide selected from: a p-:d ynuc Lec t ide 
comp.rising a pC'lynucleot ide sequence encc-din': (A:, a f-jlynucleot ide 
cz-mprising a sequence that has at least 70, oU, 3iJ cr 95'*. identity to a 



P : lynucleC'ti''ie sequence eri'-^v'ding the 7 19 re^^idue sequence, a 
P ■ lynuclec t id*.' sequence c "^mj- n.-; mq at. leas-^: ' u ) , 80, 9n ■: r ■■ identity to 
t r;e I'loO bp ['IIA sequence, a jr-o l ynuc Lec^ t ide ■ •■ .npris-in-j a sequence encoding 
tr:e 71'.^ resiuue sequence, a po iynuc leC't Lde wnich i s trif.: ylu-' bp DUi\ 
;:>.'que^nL:e, an : solatec po I ynu^'i cot ide cbtaineu by s-TM-ning .x I ib:, ai:~y under 
s'. ringent nybridi ::ar i.-jr. 'U-.i i r i, -ns \'^iz\\ a ru ■ b^t hav'in f f ^le .It'i;; bp UUh 
S'"-rjuen<.;e '^r a ju-a-.unent ot it, .i p'l^ i ynu<.d.e :>f i Uf ■ -n'::<'d:u(i ."i ::i.;tu::^:^ 
polypeptide expressed h-y "h- nr^dE gene i^jn" -j : r.^^'-'i in r-pt^ - -oc-;-!!;:: 
pneunon.ia^, an:i a ^-id ynu s b:-:. t id'!; se-puen::-!' ; :■ c.} d.'Ur.'iu.t.iry .:ny ^die 
.i Sij.b'it eci p'O.i ynU':;Ie':'t i'.i*-s ui.)' ■vt.;; 

(.') an antit'jay antiu^nic r-r iurui-iun'j:} p^-'C : : i ' {'■■r A); 

( :d rL-;:triju I'M' tTiO " I ■ ; < s : I' . n *: ■ d" E::: _ n ■.uor^.. i : 

{-u; in n*.ie:i 'jt enhance ^ ar'-ivity <■ r *:;Mp SL*;:noi n ^.c' 'A) :oripiu. s:.nu 
a unrnisterir.:; t^:- the in :1 1' ' : -.lUd 1 an aq^'nist * t.i;.:- peptide; '^r 
prcvidirig t (j the ^nuL\^riual an - sS'i-ji*. ed ;:u.)^ ynu-.o ^-c-^ lu- - -^U'lnpr i ::.i r.u *::he 
r.* :d '/nu':: l.e :>t I "ie se ]u*tn-::e ^u-": '■''lirp: r:::e polypeptide in t :u."m a.r 
f-::te'::r prodncr^<jr: ot tn-^ polypeptide a'::t:vi:y in -ji 

(b) having need to i.nhibit tPo activity -:)r exrvress is^n (A), 
C' .niprising, admin: s t -/rin ij t'.- th^ i.ndiviauaJ "m .ir:t ^g^- nis t tt the 
peptide, a nuclei'.: acii.i km) l-':ul- that mhilj^rs tn- expression o t: a 
p. dynu'-jleotide sequence eiics-ding: th.e polypeptide, .-r .i 
polypeptide tnat C'jinn et*:;s v/icJi 'A) r::)j:- its J;u"i:':'-b su:-'S r ra" <:>f 
recept'^^r ; 

(4) a pjro'jess fot oiia ■jr.':)s mg s^r p S'j-;in' ^ n i a :i;--;.-is- ■.u 
suS'.:ep;dd:)i 1:^ t y t 'S a .b.^J•;;:iSe -n in ^ :'i i "/ i-:iu ii ... reiut-.-d ".o -.-xpi es s . ^^n or 
a:.:t].vity of ^A) C'Dmnu' i s inq dete i-nd.n ing the p: es-'r.c*.; ■yr c^ii:'ser.-::e i: a 
nii.itation in the nu ::1 e st i :ie sequence enctcim; [h] in *. r.e genome 
o: the individual, or analysing f-or the pr-.^s-^n ';u ■;:ii.-iunt ''f ' 
•■■'Upress i ::>n ■ n a sample -lieri-.'ed tr'tin *:n'^ i noi ] ^ i .la I ; 

{Lq a niethoi f.-t sjreerisn^ zo iientify ::-:):rps.:nds th'it ::::t-"ate or 
:s.n:oi^: the function ^u; 'A- :orqp r i s i ng : 

(a) measuring tne binding a caniiidate ■njnp-jun'i t. . (/■q or t'.) cell 
membranes bearing it or a rdisiori protein tnere^jf t^y me ins oi .i 

label directly or incirectly associated witn -.he ■.-an.li.-iate ^umic^ound, or in 
the presence of a labeled competit'jr; 

(b) testing v/hether the carididate comp'jund re^-u.:.ts m n s::;unal 
generated by activati'jn or inhii:dt:on '^f (A;, usir; : '.ie te^-t i .: ri syst-;^ms 
appropriate to the cells or cell ciemoranes L'oaring (A) ; 

(c) mixing a candidate cc-mtound with a sjluti'jn contair.ing (A), to 
f'jrm a mixture, measuring activity of (A) ar.d comriarmg the :i':tivity of 
the mixture to a standard; 

(d) detecting the effect ot a candidate csmp-cund n the production of 
mFlJA encoding (A) and (A) m ceils, using for instance an RLI3A assay; or 

(e) contacting a compos it i-vn comprising (A) with the ^^ca^pound to be 
soreened under conditions to permit mte ra ::t i ^n t^etween the sjmp^und and 
(A) to assess the interaction ol. a compouni, such interaction being 
associated with a sec'Ond vnDmponent capahle or. crC'Vidmg a d^vt-rctable 
signal in response to the interasti^n of (Ai with ^.he ■romp'jun:!, and 
determining whether the '.rc-mpt-un .i irtteracts w^th ani a--.:tivates or inhibits 
an a'Ctivity of (A: by detecting the C)resence -ml at:)se;o:;e -^f a si^qnal 
generated from the intera-rtion of t.he -rx^mpoun .i w:i th (A); 

(6) an agonist tu- antag:-nist of the acti-/ity )r expression -jf (A); 

(/} an expressi'jn system C''m[)risinq a p'j l yni:c ■ et' t ide capible of 
producing (A) when t.he exrressi'jn system is presc^rit in a c jmL^j.^it ible nC'St 
cell; 

1^) a n:)St c^-11 '::omc rising tlie express systeir. ^^f (7) -ji- a membrane 
:■: It expressing (A) ; 

(9) a prC':ess f-:r pr-iducing (A) c-:mprising the step of cuiturin-::^ the 
nost cell of ( ) under cc^nditi^ns sufficient f':.r the or D'.iuct ion of (A); 

(10) a princess ^Jr producing a nost celi. ^-omprisin^ the e:-:press itin 
system of (7) ■: r a membrane ot it expressinj (Aj c^m^irismg the step of 
transforming or t rans f ect in g a tell with an expressi:n systeni :omprising a 
p-jlynuclecr ide cap^able cf prj::using (A; when tne expression system is 
present in a ctmpatiiile h-tst cell, under aoprorriate -s^ndit ions , produces 
(A) ; 

(11) a hC'St cell produced by the process :f (10) -sr a membrane 



thereof expressing (A) ; 

fl:"!) cor.f'unei" rreadable me'lium havin'.f stored Lhereon: a 
}■ 1 . \' n u c 1 e ' ^. i d e > • ':)iTip i ■ i a i n the 1 • 0 t p L'l N A . e q u e n c e , j po lypep t i de 
.:--ni{:'risinM trie '/ 1 9 residue sequeace, a set of prtlynucleC't irJe sequence 
v/h*;;ie at least ltio C' t the sequen-.ies c^inprises the 21*^^0 t-p oMA sequence, a 
s-'-t ijf polypeptide s* ■■.ruencf-s wnei^e at ItOiS': on*- ot the t:eq-..:ences 
' •'■■mj:' r i ses i" he '/ ; 9 r^:-;-.- 1 'due seqi^'nt:e, a c:<ita 3(": t l ep r e; -ent i riq a 
p-t 1 ynucle'.'t ide sequen-^:; c: iiipr isin t h^- 2Itt \: l'NA Sfqu^i-nc-, and encoding 
.u polypeptide s-:juenc.-; cMtpi larr^j the 7 i iesi<:i.e sequence; 

(lo) a '::c^rTif:-ute:' hn-:^:i:i n.ethed fru" |te r f ^^rrd nc^ h«:>m(. i^'gy identification, 
.-■■uupu- is: nq providing; a r c 1 ynu'::ieta ice se^-p-^^^':"'^- -tomprasinq * he 1:160 \: p LiNA 
.^•.•■:{uer[C^r :r: a input *j r .r t-^a.iO^^^ is-;.-d.i ur. isp,i; .s.i :\q th-E- pt'._ynucieo: ide 

;■ -'quence " at ^east '. r.-- po 1 ynucd eo: r<";*E -.-r- polypeptide s-:q .:enc-- 
t loentify ht-nt'ltqy; ano 

^14) cc-kl;:' .Iter r'as*-:: it'-thC'U : <: r pn.. 1 /; .ucP m: - .-e Eonol y, the n.ethod 
-O/iTip'risrnq pro^.'idmu rirs^ p-1 yrsr:-^-ot o :e sequenv/e conutrisina the 216(1 
t'Tj L'TJA sequence iri .j ctr'ip^it e r r e i-:huh'l- ri-'tiun., ,r < u-^-tu . nq f'jr at least 

<tne C'veriap'P'inq r-qiL^n h-etv/eer; toe first p lynucl et t i^le sequence and a 
s-::ciid p.ijl ynucl^-et idtv svquen 

Ac:tIV1TY - Antibiotic; ant i nf laruriat 'Ory ; audit':: ry; antiulcer. 

MF.(:HAiI13M of ACTI'j:J - Vac'-ir:e. 

U2" - Ihe r.rdE jr~iK-^:.c-s n.^y 1.)^- use<:: m a ^ii .u'tn^ s t i c tht>d fo:i- 
diacrnjcis t'f dise.::S-, s^.-.i:Mn;i •'t ;t:se--ise - r :t:esr:crise 'ti ari mfectlo'is 
■■ rqanisiti tt^ dru-js . Th^ se:pier::>-s a^e pr-E' r -u at' 1 y ucr-'i ^ ,-ii>,,:nrse ba<-terial 
I nf ect icris , r) .iru. i :su 1 a i I y ^.h-js-j 'SMu;--d h/ 2 1 rep t i ::t' tC'..s riieuii.oniae . I'he 
se';^uences rriay also oe used to prevent He 1 1 ■:;cb'ci..d. -i' r oyl'.UL LnfectiC'n, and 
otoitiacn C:incerq ulcers an:i gastritis .^aus-r-l hy H. py.tii :.r^:ection. They 
::.iy alst he use'l to tieid.. 'iisenS^s tause:: :-r i>d;it:-d tc intt- tt ion by 
}" :-t r i , i r:C h.i'l tng '.^ i " i s rneii.a, c : n un "^^ i v: t i s , pn'--ur. inn -i , t-act e r--ni i a , 
ri.'jrdnai t is , sinusitis, pleuial ';.-::ipyeia j , .xi\a •aroli -j r ct: t : s . 

AhVAhlAdF. - tlon^ qi--n. 
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TI Three =:ii:aens ional coiiLputer iriodelmq to i.:ientify HLA ;:undinq tompounds 

useful for moduiatiriq irririune responses. 
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MOVEhTY - [■1eth-:)ds ;A.^ f'-»r iienr^fyinq and usinq HLA ( Histocimpat abi 1 ity 
Lymph'tcyte-A System) binling to^npounls as HLA-aqonists and antagonists, 



comprising the comput at lonai prDcessmg ot a d^^tat^ase containing 

iimensi-'ina... rt:ri;-:tures of rt-copior- sites ciroi f":[ieriLZ .1 compound:^, aro 
r . e w . 

DE"'A:LEL' r'SSCT IPTI'jN - M^rtncd {hi oi |: recessing a -omp^jund cata base 
c:.ntainina 3-d.. mer;; ^ iona I s tru'.-t ai es cr chenical CMnpounds \o provide a 
:'rj(i c-^]:ip--'un':; rapat. -t bl'-^<::'.ing .a cecept-.r aid:e i r: a ri^s^ it;c^ Le'i'ule 
' •■:':ri|>r i ■■■ i r:-\i : 

{!) r/'delm'-j ^- he : -c im-n;-- i 'jr: ei I strictare of ^ he re^ept-jr site; 

(i; p...£^tio:ii;.g a ■:■ i^iup'^md I re^ni the i,.;ta t:.a3e in th^E recept jr site 
and assigning a 'i*:- -met r i :-.^] - f: 1 1 ^S'lr--, indicating tee fit l.^etwe-En the 
r:n\]-.'Cu:-A ^nd th^ receptor site; 

an:i fcrnmj a or .eg" cf '■■triip ".end r v/it.r. a r^r.k ot a p.ss-det eiT.i. n- :i valae, or 

n; ■ :• J im: z : r.'.: t;r. en-:'i'.iy function :ies rii) incj ict e isa^jt .. tn;:: 
^ .:.t. ; : > r ' ip ■ u n c ^r^A 5:. re vept'ti- si.te v:!; i.: t i.ng SO'S rcii na t-s :i the 
c smr^'i'im J to obta:.:. a mmiiriun -nertj/ :M'mpc and-n^jst m^jlecuje' :::oi;ipi':n 
r rM'-'T iir^e ; 

ft) L.L;nking t Ite somp-junds accMciin^ tne..r minimuni energy \^aiues 
arid fC'rmina a suT'croup of ct-mp-ounds with a mini iiiuni-ene rgy rEine of a 
predetermine':! valu- or- nivjh-r; and 

(*■■) visualizing (c-r. a :rip>ute r ) a minimum (^ner::y C'^mp'-oan-i-n'SS t 
rioi-cuie -^ompie:": anc fcrniiny a sessni sat;roiip nt c-nptands witr: a 
visaal-ti^ s.^tistym;! a f -re-ie'. e rmined cri teri-CL . 

An igLfbl'Eidr^BP CLAIM is :ilS'^ inc:i.ii:i€0 fee >i mei.r.(jj of i nri ir- 1 1 mg the 
interactii'n ^.t an HLA molecaie to an antigen -oi:if a~is ing tdi-^ ind ri.;.st rat ion 
■^■f at least one -jomcjound -.'t f'^rmaia {I). 

PI, [■:: - jp-tunaiiy scr)sM.tat.ed phenyi, oenzyl jr jth.-'r S- i- 
^-memherei ^iCimat:-"' rir:g sy"te-:T!, opticTiaily stTita i.ni.r.c one or mc r~o 
het e r-'Oat'Sri.s seL-Csr-j:! fiom 'h, i and U 

F - H, (ptronaJly s ar:?- 1 1 c uted pnenyl, henzyi jr ddioi; aromatic 

ling syst^rn, 1-1'' ■■; aikyi , l-i'j C alkaxy, tiai-i^fen, Scig.^, a:nide, .or C'jOR 
M = H or aikyl 
PI = H .jr aikyl 

PS, R7, p;l = H, halogen (F, CI, Br, I), alkyl, l-lOC alkoxy^ 

amide, MOl , amine, I-IOC cy zl.j.:.! kyl , nitroso. Oil, et:hei, ester, saifonic 
acid, aikenyi or allyi 

Y - n .zr c 

ACTIVITY - In^:[iunosuppresive , 

MECHANI3M t)F ACTION - HLA-agonists ani antagonists - 

USE - C'jmpounds identified by ;A) , cr of formula ;Ii 'tan be used to 
-reat autoim.mune aiseases, .^rift versus host disease, transplant rejection 
and multiple scler'zsis. 

ADVAhTAGE - (A) ailc.ws adjusting of a compS'Und's structure tD 
■:f:-timize the fit between the nest molecule and homolagcus compound. Ihe 
rr.ethcd also allows for the modeling of host proteins whose 3-D 
structure is unlin'zvm. 

DESC? IFTIOn OF n'FAWIt]G(S) - The diagram shows a 3-D ms.del of the 
hLA-LiRZ''.)l molecule prc^iuced by homology modeim.?. 
[..g.i /iO 
FS ^"FI FFI 
FA AE; Gl; D'd^l 

MC zdl: r0-l-:d).l':2; B'.i^i-.:iV; BG4-Eil; B04-GO4; B04-G0^; BO'l-NiJ-; B0t>-D04; 

P0t;-D07 ; Bli)-AC?C; Bll-COt; El 4 -GO 2; B14-(30kC; B14-Gi:)z!:'; B 14-LOl ; 
B14-hO^:o- B14-..:C1; D05-H09; L'C5-H11 
Ft I: T';l- ^'::z&4B; 1 'Jl - J'l ^'Bl I ; T':-l-:0-.A; TY'1-JF;-C4 

TECH fpt:-: : iOCOOl l-i 

TECH1-]':T/:'C-Y FOCnC - 1 HArMACEUTIGALC- - Preferred Treatment: ':zmj:zunis of 
f'..rmuU (1) are further admin i st--i e i witn anztner active acen*: selected 
from. '";D4c-lidana antag^znist, szIuIvl^ C14 1, ant L - :yt .eine antirody or 
?inti-..-ytokine receptor antiooay. HFA-inhi bit i^.^n results in reauved 
cytokine, preferably IL-i, crzdu:-tizn. 

Preferred Cs-mp-zunir (I) is 2 - ( . 4 - ( a :et yl ^mi nc^ : phenyl ; az mz; -i:- (•:-(( (4- 
:acet y.aminzj p.^'^nyi ) amine) sul f '^nyl ) -4 - c-xc ( B-hyc rogu inaz zlin-3- 
yl ) ) u -etami de or zne of 15 dis-^losei analogues, espe-ijlly 

J- ( {1, 4 -iizhlorz phenyl) aminz ) -N- (7- ! ( {2, 4 -di chlor zphenyl ) amino) sulfcnyl) -1 



T:-:'"HN(.!LC(;"Y FOCUJ - [:■!■ ■TECHIK)L'"'GY - P r f e r re!.i Kr:eryy Function: 
Th- ^^.-r.ercy function ::-r (4) coiiiprises a Van aer Vl:iaii-: i nt eratjt ion 
^t':m oH'-i an e le^ :t res" ic int ^vract ic^n tc:rr[i. Mininra.ri.nq tne ^^-nt^rgy 
function '::':.>nLp.r i .-;.es p. .r^-binq th- ccanpr.und ' s ccn f (-.■rma t. i ona 1 

r i-xil-'i 1 ity . Tn^^ rriiriin-um ener-jy i <d glcbai mir.imum of the 
function. Modeling : B-dinien^-^iL^nal stiucture of the receptor 

-wr-QS cC'TTipr i se s prr'Vi'.una a s-*' thref;- '::0'jrd.i nat *:'S fc^r h-ech atom cf ^he 
h.jot rroleouie ( j e f ai: .ly HI.A- : >?.- 1 ? 0 1 i oefinin'; a \,osi^io:\ cL the center 
':'L tiio atom in <.i : -o i :r,en;-- io-na i leferentiai. 

:'.r r re^:: fr : '::e.-; 5 .i r.a Metr.o-:;: tr.e n^ .70 ir-"lecc.ie .1. s o protein v;ith 

^r.■ 'V/r. priir.ary and nd<. ry 1 C':-/.re3 , and cn.-.now:-: ^.-rtiary structure. 

I'r.*-' p':-5iti'Vr. th*- ■;--nt^-r f.d ^ac'n atom t :he hC'St i[iole'::ule 10 provided 

o:: ; aligning tn-' n'j:r: n.^-d -o-n 1 • ■ s 'i-'j :er\ce '.oLtn a homed o u.k 
protein ( r e : e r.-itd y Z?l 1 t--r:. iary .:tr^^cture .Lne-:! h'>- ar: 

i>ray ciructure cf il'? 1 C'jmp'i enea vjitn a:"i infiaonza peptide) 
. ")K';-n-::e ■I:'!: " -j i n-aJ f r'':r:- :i a:"'.-:.=-e < f proteins v;i^:h <nown 
r^_-r\.iary s t I'uc t^rr ec ; 

(hd .assigning a sequerice-homol'.. gy ::-c«:-re ro each hoiuc-l-jgc-ais protein 

, i n^i 1 ::at irig :h- p^-r c-nt.,ig^r lC aniLii';. aci-i;? -jccMpying identic.^l positions 

:,n- :.:-s:. jnd nL>m':d ^-g > ^ .;o ;.eg:;eK.>— , an',; .f : rni.i r:<J x t. •■■mtahiil e h-jot tertiary 
^tru-t/ure rg- ver i y i ng ^ne atv-n..? --d a t irciecvale h'^CKbcrie c^n the 
;:ca a::;- --riC ^ n'-'ir-c .i. an protein ^;ltn a a g ^izn^.:e-nc:mc:.I jcry ;?.cc:re 

■ .■f d ra'eae t e rna.ned value a: iii.'/h-r; and v/eriayi:\g :r.e ata'ir.s: c^ f a hC'St 
oi-.:e :h£iin v;itn an -:--:ia i '/a 1 ^: r.t ^d oe cnain in ^ne noira i.a g-aa.^ protein 

th^ r rH^sp''-.Tidi no jt'jms .in "h-- l-vMnca';) ji'Mio protein; and 
o-- ; r^^finin':! : t.-^ t-ttiary f r 1 1 -d r-: tem}:)_at':- r*"/ ad;e.5*:ir. 5 the c>03iticns of 

n ■::.!■)! 1' ^a protein t : i:a-\)vid-;- . atrartar-;' witn a l:o -iner^v/ 
Viluv ietinO':: t-y i nr r a i : na .h€':vj---en t.h-;- a^cm;: i.r. t n-r h:at uicdecule. 
F'^r^r:ing the r.^^idiid fit- t-rd.iriiy atr..ctjr'- r'r-terardy _:*-:-mr'r laea rMi^sitiC'nmg 
a temr'late •::C'nip<_ ar.d v;ith knc-v/n has r -j:d.n':iing proper:iea in rh*:: recept-ar 
aire, and addin^i a term desc'r mi ng interact i c^na bet:v;een the tentf^late 
:-.:'ripound and a ii'jst nclecule aide cnain r ■:■ :he int-i-rn^I energy 
function, 'i'ne netnco rurtner ■■'■-mpr 1 aes t'-atmg laentitiea 
c:.m[-a'und5 for ineir ardlity t.;. sf e :i f i :: .ii i 1 y rdnd t-:' HhA-oFdSOl and 
ae lectin a rhc^a- with th^ greatest bindina affinity. Analcnuea of the 
a^Iected comC'C'ar.^la ^le ■-■t't a a ne'i, and are teate-l by an in v i t rc> assay that 
meaautes binding to HLA-bP.l aC; i , an:i tnoae with tne greatest affinity are 
a-} -':d. ed f or i i"i viv^a usa.ie. 

F'rel erred F'.ecept v^r i^ite: In-.- r-ceptc-r site camprisea r.egati.veiy charged 
rvjj.tet and n hyvirc fjh : bi'?. C'Tcdi-t, ana tne template ::o:TiC'Oun J cDniprises the 
aiie chains :d resiaaes 1 ; -1 arii l^:.i of Myelin Basi::: Protein. Th.;' 
ai:i- Cham of resioae 1:4 ia lispcaed in the hydropfi :'r>i c r)Ocket and the 
aide ■■diam ''A. r.-sidae l^"i ia in the neg s »■ i 1 y charged fi'Ocket. 

L123 AddWEF' 3(1 '')F 7':; VvTdX ^"datVPMHT iOMJl [>EF;WE:iT INPJFlMATIt'N LTD 

AN _:Vr:-a:-:8::;54 ; 0 '-g WFdX 

DNN »')a-('d:r: 9 4 '-^ IN'? C. ' 0 1 a- . r 1 <r- 4 4 

TI d-jvei ::'tap)nyl .) .:occai nr.:'.; p ■ 1 y niic 1 e at idea ar.d polypeptides 

in-:efai for aareen:ng ::ar ar.r 1 iaic^t r ia 1 ::ompaunia. 

DC W-.-A fi] d: i Di.: aCt-i TOl 

IN BDAdK, M T 

TA dd']!!-'. S[;:Tid:di:iD v^z':\\?-:\ t-p? 
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ICVl A61K0 3 5-7 4 

I ^3 A*) 1 K'j 3 8 - -1 ; C07K0 16-00; C 1 2 N :i C) 1 - 0 J ; C l2::01 3 - 0 0 ; C 1 2 M J 1 2 - 3 1 ; 
CI2E'';21-(:^2; C1.20n01-nP; i iN';. 3-<:. 3 ; G06F017-30/ 
G06F159-00 
W' 9'.^']7147 A UPAB: 22003)118 

N( -VELTY - nrdG po Ly no* ' 1. i de (CI' LS-vLate'i from ;;.t aphylO':.-'jccus aureus and 
t:.'3 polypeptide (1) i ^. er.'^jde.s, i r.';-w. 

DF/:AILEL"' r.'ESCF lE-TIOii: - Tlv- i La 'ied nrdG pol ynuciec t ide (II) selected 

,..n i;;.... 1 at p- 1 ynucle jt i d'- t':\az has at l^i-ast ^^^'2^ identity to, and 
C'ver t:t^-/ ent ire length <: l, a nucl-ctioe se^juence e-noodir-Q me 17:. amino 
a^'id se'.fuence '21) jr--:n .in tn-: sp>: s i : i c at i :'n; 

(h; .^n .isolated j. '^1 t'n^acie': t i ^ie iripr is i nq a nasi es t i <:is- sequs-ncs- \-;hich 
has at least 9 2 ! ioentit'/ to- the : -3 S'asep.iir sequence (22) qiven m the 

{•:] an isolated ■:■ i ;^'nucies t i 'ie tnat h.;:S or i.r rhe 322 se':5uence; 

an iS'Vlated ]: sd \'n.acie'.: t i d-: c-i't a inal^le by scre^wiiru an apprs^priate 
I dt^rary under s*:rinqent hybri^ii zat i on c^snditi-sns with a probe having the 
2^' sequens'e, or its fiaqnent; 

(e) an isolated polynuclec tide encoding a itature polypeptide 
enp'ressed by the nrd'3 qene of 2. aureus; and 

(r) a pC'l ynuo 1 es't i de sec^u-i-nc-' <„-Li[p i em^ii tar y to ths' p"::)! y nucls-ot i des of 
(a ; tc < 7] , 

Ii;r'E2EN['EI]T CIAI[':2 are aiss* i.n.:luaed t:.r the fs'lls.winq: 

( 1 : An i S'l ^ at O'i polypeptide s-r i e :t ed r :in : 

(a; an iSi..'lated polypeptide '::.'Mrp'ri s inq an aiains- a^sid 
sequence navinq at least 25^ i^-^ontity ts 31; 

{T'l an iS'". latei polypeptide '-c irp'ri s mq 21; 

{■:•; an iS'.vlat^i polypeptide wni'S^i is 2' 1 ; 

(_; An arit^O'Sciy ar.t.iqenic ■.r 1 nm.un tsn-rc 1 f 1 c for triS' 
polypeptide of (1); 

(3; A methO'd for : n^^ treatiaent ^f an maividual in need: 

(a) of ennanced activity r.r: -xp-ress ion :■ f (2,'; 

it) to iriiiibit a'Stivity or eMf^ress ion 'id' the polypeptide of 

(1); 

ih) A process for aiaqnosi nq c^r p^r t-gnC'S i s a disease ^^r a 
suscept ibrlrty to a 'disease xn an inaividuai reiatea to expression s-r 
activity of (I), corr.prisinq: 

(a) deternininq the presence or ak'Sence of a mutation in (II) in the 
genome of the individual; and/or 

(b) analyzing for the presence c^r amount of exp^ressi:'n of (11) in a 
sample derived frcm the individual; 

(o) A metnod for screeninq t'S identify cs^mp'ounas that activate or 
that inhibit the function of the polypeptide of (1); 

(0) An agonist or an antaqi^nist of the activity or expression of the 
polypeptide of ( 1 ) ; 

(7) An e:>:p'ress i'jri system 'tC'mfjt i s inq a pol ynu : loot ide capable of 
producinq the polypeptide of (1) v;hen the evfC're ss ion system is 
present in a cc-mpiaticde htist cell; 

(8) A host cell comp'risinq the expressijri system of (7) or its 
m-.-mbrane exp>ressinq the polypeptide of (I); 

( ; A pu-'Oress for pr jducing the polypeptide of (1) 
C'jmprisinq culturinq the nost celi of (r) ur.iier ^^r^n :ii t ions sufficient for 
the pr'jduction of the polypeptide; 

(I'j) A process for p-r-jducing a hiost cell ■:■ :>mpr is inq tne expression 
system (7), ■:• jmp r isin^g t rans f orniin g or tran; footing a sell v;ith an 
exfjressiC'n system such that the n^jst 'Sell, under appropriate culture 
conditions, [T-r-jduces the polypeptide of (1); 

(11) A host cell pr-jduced oy the process ot (12) c^r its membrane 
exr-ressiiiq polypeptide of fl); 

(12) A ■::.onputer rea^datd-e medium having stjrei -sn it a member selected 
from, the grciup' consistin^g t'f: 

'a; a p)-jlynucleotide comprising the S2 sequence; 
[b] a pj'jl ypept ide C'jmpirising the SI sec^uense; 

(■::; a set of polynucleotide sequences v;herein at least one of the 
sequences is S2; 



(■i) a set of pol e|.'t id*.- sequences whe^reiri cit least one ot the 
5equen::es o-Maprises tfi^^- :*1 ::t-quence; 

fei a 'Ut.a ::-et r^-^f -r* -ser-nnq a ^ o Lynuc I.e-^t j.de -equence -oruprismg the. 
S2 secjuen-::-.-' ; 

U} ^1 dita ;:-et r-t rt'ser:*\inq a p:d ynuc ie- 1 j.de sequence encjdiriq a 
p-,>lyr^epr ide sequ-nc-.^ ::mpM-.nq ':he 1 sequence; 

a oolyriuclec '.. 2d*.' C'jinp' r n nc the s:: :.equ(;nce; 
Ei cedypeot ].dv; -■:.-:npris in j the 1 sequence; 

a set ' ( i:^ p'idynu-.deotide se(;.enc(^3 where at least cne Z'f the 
s^rq^ierices :-j:upr:ses t hv hJ secfjen-e; 

-M a ^-et ct r- ] -/r. ef.-". 3^-quen-;'er vnner- rit. leas*: one of the 
seqiences c^r-i] rises tn-;.- hi S'^iJ-rnse; ^ 

■ ki a -j'lr ^ set i-epr--sentin T :i i .. 1 -/nn s -Ce/*: -de sev-ence '..ornpr ls ..nq t^e 

h.' s^:<.juenc'';w 

■'hj a :la-^a set rer-.r-.--.erit .:. n j ri y : 1 ynu s .e- u. ;_ie sesuerice encndmcT a 
re l-/r^-j-)C ide s*:-:ii:-r;ns- r - . nipr s . n 'i *:ne :-l se^.ien-;e; anl 

'"\l3) A c...mputer hased me^nji tse: perfor-imj ho:Ti:-ogy hJenti ideation, 
(-■■mpr ising pr-jvsdmq p.-.d ynuc leo t i .:e sequen-je comprising the S2 sequence 
niv-Mi en tne spe^- f icat 1^ n m a -r^meuter rea-i..o...e nLe::-um, and c-smparing 
-lie pslynucle'jtide sequence to it l-rast coe p'.lynueleot i-ie ^i^r polypeptide 
sequH-n^:e to identity riL-m-j logy . 

/^ C TI V I T ^i' - A n t 1. a ■ t *. .- r r a 1 ; : : y r o s ^ a 1 1 - ; V . rl n e r a r y . 

[■lEdHAdlhl-: OF AhTl'.lI - Wa-:c\.fi':, 

■h>S - The poivnu ::e-::t ides und p;: lypef; ti ies may te empl-yed as 
rese/ircn reagents ana I'-a^eriaL rs^r r:,e dis::c-.^-ry ol t reatineists and 
dia'-rnostics for .diseases, pa rt i ;:ulaL ly human aiseases. Tney can be used 
for^iiagnosis of the ■disease and sta.:ing of i.sease, .md as reagents m 

airrerenti^i sci-en^ng rii- n-.-lo . iri^ j.. .^^ ..^ . _ - >^ - - 

f,^^ h-^rr : :iz .it 1 . n nr-bes, an.^ f:)r ssre-^ning o: genetic rrutations, 

.^or,-,--'po, :3-q L.>^-ni:_f K-dt'ic-n, ana f u or:janis::. ::hrec:c s erne j clent i f i sation . 
Tne p'lycectues san t . us-::, tc ::rs.:uce antLO..dLes. 'he p s] ynepn i Jes can 
^1.5. i",^ ,iced :n vaccin^b t ■:■ rrmul a 1 1 c^ns , and tc ident^ry agonists and 
anragonists. These are us^rd to prevent, inhioit or treat diseases, 
particularly of Helicobacter pylori infections, sucn as gastrointestinal 
carcinom.a. Diseases diagnosed, treated and prevented by the products 
incl'ide inl^ctions of the upoer anc lower respiratory tract, sardiac, 
qastrointestmai, CMS, eye, kidney and urinary tract, skin, bone and 
joint. The polypeptides can also be used to treat wounds and body implants 
to prevent infectit.n. 

ADVAhTAGE - The frequency of St aphylc-sc real infections has risen 
riramaMcally, and it is no lonqer c^i.mmon to isclatej S. aureus strain that 
ar^ resistant to standard ant ibict i-:s . The nr^iG proaucts or the invention 
can be used screen for new ant 1 r-ac^t erial comc.-.iunas t nat may target these 
resistant bacteria. 
Dwg . 0/0 
FS CEd EPI 
• riC|j 

MC CPT :^Br)4-C01G; P(J4-E03F; B04-E0t; &04-E0&; DCM-F10A3E; E04-F1100E; 
&ri4-G01; BO'l-i:03A; B12-K04A; E14-A0i; B14-H01; Bl^-JCU; 
P14-h01; Bi4-hOt; El^-IiOl; B14-N03; !'14-I]07; B14-N10; B14-N17B; 
Bi4--nB- ['i)5-A("aA4; B(>5-AC;1B; jOE.-Hm?; D0t-HC:8; D05-H09; 
n„;i^,-Hi:; ' [:05-HKA; [■05-HltL'l; DOS-Hl.'E; [iOe.-H14; D0S-H17AG; E'08-B09; 
E.'0 9-Ct:lB 
EPl : S:)3-Ei4H; Tt)l-JttB 

hl23 AhSViFB 31 t-F : WFIh t'MPYF.lohT ^-JJl PErd'iENT IhEt^RHATIOK LTD 

AN 1 '. ^ - 5 7 9 ' .1 3 ; 4 9 1 w [■ I 

DNN dl ;^^9-438 : 13 f-d''^ C 1 999- 1 l ^^67 

TI Ia*^r.ti tying pathways if :irua action. 
DC BC:4 rdi) S9 3 TOl 
IN FPIEND, S H; STCUCHTtTl, P 

/Ufn^T-?-]' ROSETTA. INF'HAPttATI'"5-' 1117 

CYC 3>:. 

PI US 3'^b5352 A 1-.^9^101. (199949}^ 47p i:i./jqJ 1 -U c 

WO '■i'"-^53708 Al 1J991118 (30000^) Ed ClCttOl-Ou 
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Oh E'T SLi SE SL SZ uG 
W: AE AL AM AT An A:: BA BB Bf^ BR BY CA CH CW CU c: DE DK EE ES EI GB 
(.;D GE Gii GM Hh HG IT- JL IN IS JB KE KG KB KR P'G LG LK LR LG LT LG 
LV MD MK MI) MW MX WO WZ BL BT h.O BG SE S( ; SI SK SL TG TM TB 

TT GA l^G GS Vtl YG ZA SW 
AG 99^B'>06 A ]'V^'ai.:9 G": B :j fi 1 i: IBor.' ! 1-00 

ADT GS 5i^^G.::^: A GS I --fj:-^- 7 -l oe 3 ! ''^ 0 0 r ; W( ■ ^ ^ ^ 'j 7 Cc AI WG 1 5 0 0- (/S 1 C 0 5 ' ■ 

1909ijM')7; AG O'f 3 8 ■ * ) A AG ! ' ) '.^ 9- ' ■"■ 9^' 6 l'.0^^'''Gw 
EGT AG 9';v:-bG<'^j A B.jSe^i 'Or: VJO G'^'/fr /i' -' 
BKAI GS 1 ■-;^!:: -7 4 9.^3 I'-'.^tOBOO 
IC I CM CI ._C"GJ 1-iJO 

BG"^ A^'Gr'-:'.-] "-'G: ; <"G .G :' J ; . -0 . ; GG'.G^l-fG ; ; :G G: :G - ; G06F017-00; 
G06F017-14 
AB GS ^■'"i^jBSB.; a UBAB: l'H--^91i^'l 

KC'VEBTY - M^^'thod3 lV-,r dv:. e.i ni n.. r.^; tr;e [■rin.^ry ^rd .■■eci)ndary biclc:;;ical 
}j3^hv;r:::y;? thiC'U'::h wKkGi irua;r i'-:"^ oeGL,- a::d laerr ifying the 
protexns aiiii <;je:ie.\-" vvhiiYi a: e .j i r eot >rd '/iu 'i^-oG. }; atdiivay , ar*:- new. 
Hethoas are also inoiud-rd for id^.vnt i f y i nr; ^Gri^i; pathway^ affec:ed B^y 
erivironirient al ■::harir;r;, di sease and tnos- : r.r <./ua;i wh.: ch: t:^e ^m. de-e f f ects of 
oruc^s ate niodiate-:! . 

E'ETAILEB [lESCB I F T I'.iN - A n^-th^.d (I) tor idont^fymo rho hGclLgical 
pa^.Yiv.'ays> invc-Ived i r: th- a-^.^ i ju ■.•i a druo -^^ o-;:l^, 'joitipr i :5 : 

' : cie"^ o riiiin : r;''r ■^h^ C'G.GaG-ir r -r'so-' .■n.-: p : ijLi'...::eG in the (.";-G 1 B' y 
difforenr L^-ve 1 s -> r */:-:p'--suro -oj t.ne araq G:>y niea aur ir.'j ioa effoct;:- on a 
naraboi jt co^luiai' :C'n3 1 iGu-r.*: /; ; 

Ui) r-rpL^seiil :A\q d mO'del >jf 'liru^ res.^r.ns^ as a oomBrina : 1 ct. of 
;:GoIC'g:oai p^atr^way roar-ono^s (nrjdarea ny -rioaourLno ^fie co l Laiar 
ci'ns t i t aent 0 O't a J^a r. B-.'^i-:: jG pa"';way i ri r-"-- r--;-r.se lo'> afferent Gevoia of 
riorturt* it ic n ^ in iBe coll, ''-ne -'^f v/nich .i - oi:):)---:!: t'^ an : nrior^ondont: 
a::aling t :^an-: f i-rruat icti ; and 

G-; id 1 vi-r.'-rm: M.n'j Boat. a^.^Iin;; t r ^na I jrrr.at i -jn. o: th^: B^i cd e-a i c aB 
fiitliv/ay resTi'Onses whi'Gi mln-miz- ir.o vilii-; .nn oidoctive: 
function oi the d^lforer:ce ijotv;oon the candiaare dru':^ rospon^e ana 
the moael drug reap, onso (Gno cC'iuBvLnat: i on oi ■: ho roap'jnaes aurjjoo-.t to the 
t^est scaling t rans f oriyia t ion id-niifies the fiiologi':aI f^athways involved in 
the acti'jn of the drug on the ceil typo). 

0)ther Il'IDEBElIBEG'T CLAIMS are incluaod fcr si^milar metnoas tor: 
(1) detor.^dning a in.jre pathway-specific dru^ .-arididate f rc^m an 
initial drug candidate; 

(B) identifying specific Biological patnwaya which are involved in 
the action of a drug and which raediate side-of foots of the drag; 

{'■■) identifyir;g a nambor of specific biological pathways that are 
involved in mediating therapieutic effica''jy for a disease or lisc.rdor; 

(4) measuring tne similarity of the effects of t\^o drugs on a cell 

type; 

(5) identifying biological piathways mvjlved m the effects of an 
envirc'nment a 1 change ur>:ri a cell tyc^e; and 

(6) ident i f yir.'j t'Loiogical p^athways inv-jlved in the effects ■.■f an 
environmental change upiC'n a cell tyt^e. 



i t-j ':harac*: eri ze the effects of drugs on 
oi-:d.'j-:| i.-:a 1 pattiways in cells affected by 
change and dlseas^;. Tnis information is 
ags and alternative tnerapies ani in 



iJSE - The me thesis in ay be use 
■cells, in particular tor finding 
<;lrug action and by envi i-onmenta 1 
useful m the ^lis a.-veiy of new di~ 

unders^ an'iing effic'a-cy, side effects, ^.o:-: ici t res and activation ol 

me t a b o -l ^ c r e s po'j n s e ;c . 

Dwg . C' / a 
ES GBI EBI 
EA AB; BCtI 

MC CBI: BG4-E01; Bll-A; &11-C0";E1; BIl-CO^EB; B12-K04A; 
B12-K04E; BOB-A'i]; L);.!t-H'Ji; D05-H09 
EBI: S03-EMH4; T0.-J04B1 

TECH UBTG: 1 9 99 II.: 4 

TECHIJGBOGY E3CGS - BIGTECHM'OLOG Y - Breferrei Method: In (1), step ;iii) 
comprises determining an actual minimized value cf the objective 
function. This is determine-d from the candidate drjg respcnse and 
the model drug response. Theref'tre, (1} further comprises a step :)f 



asS'Bosirig the stat^sLlLdl s iqii^ 1 iCarice ■■■t the best .-•.Jaiirij 
in -a nunibei' of pathways, "his c ok^^ ::: i s e : : 
fi) -r^ptaininq an expected | -roL'.iL' i iif.: y -j . s t r iout i c^n 



[riininiZ' d \^alu 



of 'd:: 1 e ::t ivL' function; <and 

fii) ar.r;e5S:.nq the st .-i- i s t i ::a i r i ^rii tic ^nce cf the . actual inininizt-d value 
ef ohjeotiv- function .: n view . i rh-- expected p-r. r->a t :. 1 it v 
aistribaticn ef minimal:. \'ilue:-- or > ; e-:-" i v^r function. 

ilie expected"! f-rcbability distriri:ti,-n '■.':\e mini:':ur:L vaijes of ...t-ective 
function L:c:;ipi ises : 

(1) ran-i:.m:: ::ir.j In- drn.:i i-^sp-nses ..t . f f^;- rent ^--v^-ix; or drug -LX[-03ure 
and ran:i:m:i ::ing tn^- m.i.riel Lirug lesp.-.rise by rand^.m i z . mq a numl.>er 

(li) ae: einiininu a -.h--'.'re:u. ea ^ :u:.n::'xnr --alie uf t- r.e ^■)rde:t:v^- 
function r^y finam'i b'^-st seal ing t rans r ■.■rm:; c : ;'ns -i numbei: o: 
the rari:.:Miu ze'ii yr^zlAJa-; r^-sp:n::e.s wnx-h minimize t ne -:v:e-:ti-- 
function ■:■! *: r.e r:irtereri;e i:-e-/'-en ^rie ;■ anuei; i ze r--:r:jnses .r^nd 
the r an':iC'mi zeo i;\C'd'jt r^-S]:. j:iS es ; -^riS 

(iii) repeating steps (id ana (ii) i. o :i-tet-it.ine ^ nuii^ter of thv-oretical 
minin.um values d.'hich for:r[ an xpe-::^:. e :i pr^jr ah :. .1 i.t y 1 1 s *: r il: n-*: : -an r 
minimized values) . 

The n^ethod (T) may lurther .rc-mf-r i se veii fyin-:: tn^t \ho bioi^.- uuzi 1 p:irhways 
are actually invoi-.t-d in .:Mctijn f t h-- oiii^ by , 1 e^: t i nc ^ mi'del 

res:::C'nse leirher trie tii::^: r .-er: ^rri :\.:--iol sp-sr:Se ; w:ii-:n tx;naV':'S mcst; 
similarly to a -cmbined urug p^- r n - rt .a :: i on respmsv. Tlie dru'j per t urr^at ion 
response is pr:.'-:d--d ry ::;-asurin:-T a nun;;>er ;d ■-■:u_.i ^ns - _ t uent s In a 
cell expose.! sin^ultanecus ly to Vc^rying lev.^ls :t cru;; expr.sure ad varying 
levels of perturn az r jns wiahi.n tdi-; rii ■: 1 .cj b.v i 1 pat hv; ays. Tr.e rd.rst and 
s-cor.d m'T'de 1 rest cnses cjitiprise a cvmr iriaz i ■: r: ■: f se-/-i .al p.^fcn^ny :. eiponses 
snbjec*.: to best scalini^ ans f orn^t i ons evah.;a::eu a^: a nunlj'.i' '.ri su:ris 
■^rrach sum is tn-v sum a numo-r la--ebv : :." eg '-xp':M...r-- suiyect to 

id.e scaling t rans format i:>ns an:: varym:: fo'--]o a^ >-t'i rr: a t : tis to :he 
pathways). The luolj^-ial pathways are \'eriliec as p:tthway;i; :n'o-l-/ed in 
the action of tne '..irug it the :irst mo::>jl re^-p:nse ^s isea. 
(1) may also cor:ip'rise assignin^g a c-rllular cv.-ns ti tu^nt ^jres-nt m the drug 
response to one or more of the Id ^o logical pathways ;the bioh.: g ica 1 pathway 
response of the i::onst i tuent subiect to its o-sl scaling t runs f.jrnuati on has 
the greatest correlation with the drug restjonse -...f ':ne celluiar 
constituent). The scaling transformations comprise ■ransformmj levels of 
drug exposure to corresponding levels of pe r t urba t i-^n zo r he biological 
patnways. Transtorming is done by linear mapf\ing. T::.e C'bj-'::tive 
function is minimized by selecting best sets ot t-'arameters ttar the 
scaling transformations. The ornective function cc-mprises a sum 
of the squares of differenres m drug resfjonses at varying i- v-^s of 
exposure to the drug and the model response at differing levels of 
exposure (further subject to scalinu trans format i .ans ) . Tne 'jLijective 
function comprises the negative of the 'jorrelati'in af the 'iiuq 
response and the model drug response. 

The biological pat:hway responses are p.uoducea by a met:nod comprisma 
interpolating the measured values (by af/pr-T-ximat i no a sum of sp^me 
functions or a Hill function) . The pathways r.-s+-,ed are 

cnC'Sen from a c<ampendium of tnose likely t^:- be invc Ived in dtaig responses. 
Tne cellular constituent measure:: is pu'efeiably FdJA, measur-1 oy 
':'jntai.:t ing a (jeri-".^ transciip't wi*:.n r!IA of -ine :el ! <:-r \vdd-:h stlJA ifarived 

Iram it. The transcript array C':>mpi:ises a sui f ac- wi^zh atta^/ned nucle'ic 
a.:ias or mimics wh^chVare capable ■: f hybr idi ln^: t<: the RtlA (or cCKA) . The 

levels in dr..g-tr e ate d ana unrreate:^ :olls i..; ;-:r:pa::ed. 
Protein levels may also be measure':: using .::n^ it-C'.iy arrays 

{ccmprismg a surface with attacheo ar:t ib-:;di-s wni:-h tdnd t: the 
proteins; or by ■! imens i on al lect ra-j- n : reels . 

The activities od tne cellular :• : ns 1 1 1 uent s may aiso l:>e mea^ojre.i. The 
c iol-:.gical pathways originate at a number if spe^-ifi: cellular 
zonst ituents unJer the C'lntrol if an excressicn systen.. Per tu roat izns are 
zausei l-y altering the excr-assizn ani ucti'/ity nst i tuent s (e.g. ty 
transfect Lzn with a hete ro 1 og-cus gene 'ar oy tne a-ticTi of ribozymes and 
antisense sequences). The exp-ressian z-f tne constituents may als:> te 
altered t-y the exp^ressicn ■: f the protein as a fusion 



protein r:Ginpr is ing the protein ai;'.i a tj^-giMn ■ ■.■ji i ^ 
ciecjra ies the protein^, c^r by the use of a r. f; i.hea : es to the 
protein c-ther arit .agor.i ^: to , diogo cind d ^-jriio. ir. t n^g-jtive i:;o"-irit 
proteins . 

F'referre':! Cells: The ':>:dls are substantially loojfVrolc to Gafjoh arornyces 
■:-e rev i s L . 
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TI r i-edi-.:;t ir.g the folde-Li st. cootore ot proteins. 

do E-bl bio :"'oi s;7 TOl 
::: penne? , d a 

oA ( bbNN- : i bbl-^iNbP. o A 

II dd t-t-^;b-l A 1:0-^1-1: (l^-^ol^)* il:^p ddld:::.-ud 

PFdVI dd br^d,:-8a7d2 4 b^' 97(03 3 7 
Id b7[^] d:ii:o3 :'3-:d:' 

bid G06F015-00 
Ab 71 3 7 7 S 7 d 4 A U F ' Ab : 1 9 d 1 1 

NtiVELTV - Predict ir.o - h^v f.": 

obiar.:r.g' seqoer.ces >: t r.-'or^: ^ 

evolutionarily c:C'::.: t: r ve 



^:'0 0 1 1'OL't ure 'jL proteins, by 
'J :s proteins .u.d ^o^ i ng }"atteios of 
1 v^riefd seaueooes r^j assign tj-js i t i-jn;: , 



L'bTAIbE:> I'Ed'Ibl PTbjb - This ir.ooood is u = ed for ^vredi rating the folded 
-Etruoruro 'jf proteins, :oMtooi l no aLioMiog t:\e .ro oueo-:j^='S ■:■ f 
b'jino.i.':-;':"jo proteins, using p^at terns of ocnservat ion ana sequenoe 
va:r' i at ^ c^r. om o: a clear 1 y 1 i r.eo evolutionary r el it i onsn ips . 

f'l'Id*:-':: ^- ro'-'" u re , .o:-':iV':- s ; t 'S , and }:^arsino sognients. decocdary ::tructural 
onits .■.;r':- ass L.gne^i hy i'l-nt ty l ng p^ei'i odi :: i t y in tne a.E si ^ninents , and 
asseirxbe'a inO'j gl^js^aiar io:.v. ^.,:ing d.i:rt3n.-e o^^n s t lai no s iniposed t-y 
disuJiide bridges, a::tiv':' -::.it- assignments and var iat i 'Sn analysis. 

An IMDEPEMDEMT <7LAIM is ais-:- inL'ludea fc-r ei method predicting the 
seo.:Ti d.-:ir y structur-:- proteins using the ato'/e methC'd o]:* to the 
{.■■„onL assigning s-'::'jn':ia r strjciucal a ruts. 

'JdE - Th»E' f'l'eLi i '::ted s e '.:<::> n ■::ia ry structures are useful fcir identifying 
antiaeni'!^ sites on a protein msleoule, as cuides fsr site 

aireote<a mutagenesis stu-li-rs, and tor unae rst an^li ng tne inoeraction of a 

protein with ■ j t h e r m-s 1 e o u I e s . 

AL'dAMTAGE - The meths-J is m^^re effi:ient than prisir art methods of 

ri erdi :\ i :'ig fs.lded strucoure -sf proteins. 

F'wg . t / d>: 
FS bp I EPT 
FA Ab; Poll 

MC bFd : B04-N04; bll-db:E; Pld-K:^4E; b07-HC-; dd4-3:i 

bPl: da.:^-E14; TCI- I 
TECH UPT>! : 1 9 1 i ^d 

l"ECHb7ib'9 7Y F7'7"7S - E blTE'IHIajbtdlY - Essentia] : e-E;t u res : Essential features 

of th- methoa are: 

(i) e:-:j:nining ali.jne:! se^jueno-s of several h':m[ d I'Sli jus proteins 
ratn-i ".dan a sin--jb:- se :{oerL :.>"; 'jf a single protein; 

fid) '->:t r a ::t i rig i n f rma t i -jr. f^jnce r n i n g the tnree iiriiens ion al structure of 
t n-:: protein farrdly t r^jm patt-^rns jf conservation and variation 
v-dthi.r. set jf n:coIt;;fsus :'-o:ioo:.cos , rath-?:- than oy a simple aver-agirig 'jf 
a p)r'jpi^rt y oi tne s^i'-^uerrr.-es tiKer: i n:ii v I'lual 1 '^O* 

fiii) ■^■:>mbini n-:j a i r i thms that assign ci-jsitions in tne alignment to the 
surfac- or interi-u" '-.f tr.e f:d.de:l structure, and ts t 

first step f^ar pr-'-di c t in-:i se"::)naary ^:oru:"tural eienior: 

[iv) I'.ient i f ying s-iparate se^jDniary structural elemer 
using parsing alg^iu'Ithms that identify gaps in the alignment and specific 
parsiro.i sequence Siotrfs; 

IV) the algorithms are appliei ts subgroups sf proteins with 
clearly identified evolutionary rei a t i onships , in particular a 
clearly specified sequence identity and evolutionary distance; 



ne active site, as a 
ts m the alignmient, 



Zhou - 



(vi) th^ aiqorithms are designeo to reflec^: nov; natural snecticn and 
n-utral drir:. influence the :iiverg-nt evolution of 

protein s-quences; and 

(vii) a:v3--rnbiing the 3^-:-ondary structural elements to forru super secondary 
and tertiary structuiai models i: y rient in.:j these elements using disulfide 
biid(jes, ..ictive site as -iquiaen- s an-:i co- va ri itu. on analysis. 
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AN r^39-4^.)^4 :o [33] WF i:-: 
ON!] Nl 9':i'^-3'] t h^d L'l.C 1 -bi- I ; ^17 ■•■ 

TT M-tho-T desi(?nin:: a r vi f i.:sia 1 protein - useful f-r nroducmci 

proteins Vvish requ^i^-i functions. 
DC R'; 1 F^l h TOl 

PA (rIKA) r-Ir'AdAP:d KEluli'UoiiO 

CYd 

pr j: iMuAs;^- A i-^9<t 'i: : : ^:)'V-) :'-^) - lOr) (:od<()M-i)^^ <-- 

ACjT JP i:i9ji:t/ a JP I'.e^o-:--: :'::31 r-)-:^:;ltOu 
PAAI JP 1 -■-^7-i7.M31 1 iM^iOiM-; 
TC I'.M C07K014-00 

I'-S O07?0ti-00; C07K014-805; i iriO 1 5-0 9 ; G06F017-30 

AB JP 1119 3197 A UPAB: 19'.-''n00 4 

A method hir pr-idu :ir:g ..ri ideal ;im:no ac:id sequence (>)mprises rhe 

ll.-)win;] steps: f 1 , t t""-!-'-^^"'^^ ^'^^^ initial ammo icid sequence 

c.--rresu...n Jmg to tne structure if . protein, (2.. st^lecting the 
ammo acio residues i'^t:mai t-) -i:n site; {:) >oipe.t,mq the steps or 
selectm.,^ the amino -.cid residues optimum to eacn sites or tne tota^ ammo 
a. -id residues constituting the h-order am.no acij sequence ( N is an 

T . -(- 1- ■• r - M J- -.M-.-i^ r =111 -inn a "id S(='0]en''e 

^ - - -f ^v.p V::..- r .-^^ "iirn---: i"id residr-e? until tne !^~orier ammo 

a-id sequence C'jmes t^e saire oS the M + 1-oriei amino acij sequente, and 
(4) s^-i--ting the r-sultmt I'-^uoj-rr am.n eiclti se-:Kien .:e is the ic^timum 
airin.-> -^'-i.-j sequence ■: f said protein. 

Als'-^ claimed is a protein having an amint* aci J sequence 
.:.t::tained by the above method. 

ADVAtiTAGE - The method can design a protein with aesired 
functions and properties, 
:.wg.0/5 
F'3 CPI EE I 
FA AB 

Mi: CPI: EC;4-C01; Bll-CO:~^- D05-H09 
EPI : T01-CU5B; TOl-Jl CC4 

LI 23 A:]SWEP 34 OE 73 WPl/. COFYPIGiiT 2001 DEPWEllT INECPMATIOK LTD 

Atl 1999-4 58333 [38) WPIX 

r^';N t)1^99-?42;:;3B e:;C Cl 999-1345(:4 

Tl Use of independent assay controls tc perform ana monitor assays, 

i r t i c u 1 a r i y irrimu n cj a s s rn y s . 
DC F.r)4 Ed f. Jh4 S03 TOl 

AdDEPBEPC, J M; Bl.^ECHLEP, K F; XCPhERf.ON, 1 H 

;bioc-i]) biosite piaci;octics inc 
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Y-o QQ"-' r, »■,(-■ Al l^i^^^O''i^ 1^-^'"^ *3&) ^ Ei: ]4er. C^C:*' FOl '9-00 • 

PWrAT^'bE CH CV L'E PK EA EC EI EP GE GH GK G?. IE IT P'E LS LJ MC MW NL 
'tp ^^T CP SE ^'C C^t CW 
W: AL AM AT AC AC PA BP i-'G LP 3Y CA GH ■■ CP GC F'E V-V. EE EC ^I GB GD 
:-;E GH GM HP Htf IC' IP id iC JF EE l.G CP EF. ?C; LG LK LE LS LT LP LV 
Ml' MG I4K mi; MW ME l]f> h': IE PT KG' lU CP SE CG CI C?- SL TC TM TF. TT 
itA UG US JP Vn VP CV; 
AG 9'^CIO*:.^--. A I'-^^uCst. ,19'.^9P:. ; ^ GG.3EO r:^- iOj 

-O^^^qts Al _(.':':I1._: ' 2 ^ ' ■ Ed GUuEDlC-ot 

F- BE ^-^H r-V - E ['K EC E: EF. GB GR IE IT LI LJ MG IIL PT CE 

r.-.rr :,r.'>c-y -: ium.-uo.;.- iMV.r,i04; AJ 9911C66 A AJ I9;^9-;;i066 19990104 

-04^.' Al EP I ^ v,-9^'il 145 l:-9tlj4, WO r)99-JSC^l 1591^0104 
EjT AJ ■V.CIO..0 A Based ^n Ivj 993o60:; EP 104 6123 Al Based on WO 9935d02 
PFAI 1^98-3065 1^980 UP 

IG :GM G06F019-00 
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ICS G01N033-S i 

;;:;;,EL?r-"Se^.f'S.ie;!nd'^ cntrm,. (lAC.) f.r rronitorxnq and 

s.-iu.'" Jnr.:...h an .ss.y device ■ccmpru..ng a react. :,n chamber (.C) and 
at- lea;-: one diagnc-lie Une ([L.) c.nprises , „,,pp) , 

and a sec-rni MHF, l.-..ur,d to a s:iia p:ia:=e, i.. -i. i--- 

to asi-.:y reagents in the .,f s^y^al, :-rcra th 

i-.'tern-iriLrnj "he rate o t. tl...-J t-cm tr.e i j- ^ 

^d'Er-EMLElVT C:.;a!.-3 ar-:. a]r . inc.ucei ror the Llowin., : 



e:n-xronxenta. ocndrtr.n. ve.rfyxng 

diia 'J^-'j ± -^^ r. .j^oc.y - — -I / 

J:.;::atLC-n jf iere'T*:. ion zc r.e; 

:;tt;od; fo^det;;ranrn, a corrected assay r.sult rrom a :nea.urea 
result dr-.l one c r :l:or^^ 

(.J) :aC results; an.i . . , i . . ^ , e .n--.lved in k:j . 

(e) computer proqrans ^ahA^^iv^.u-a -^-o--^— - 

A' ■ '1 1 'v' I T - Nor:-:: ;i i or:. 

-.op . t;,,^ rs-o t-r. rKri.tor or::l {.ertc-riii :.]r:r.u:rja 3:. y , ya_..c... r., 

■-■^ : . ..o-c, (::-:t:orrfi:oi rate 3f rlow, 

ccrrecr ro-u-ts; o-.L^.-. ^i-^ : ; ^ -T ■ . " . ^ -.^ n o.t i.-.n -t the assay, 



envrronraenta. cor.dition. pi.; . ;^ fv" l..:-ati.:.n ot detection 

A[)VA:jTA'^£ - Tne use 'jf TA-: ensure; 



- -at assay : -r^^iu it o 



.ven ur.en r^.^ assay sonoi^^Lrns vary ^^^i'Te^ ■S'^^^ic le .latrix 

■.nd pro:;ie.^s of th-:- i™->un.::.ac..y re.-.o:., .0 ^-i.-. - ■ 

h -s s C' c u r r e ■ j v/ i t hi a t e .s i o . 
L;wq .0/18 
FS CFI EPl 

FA AB; DC!] _^ ^ ^ .^^.nj.. T>^o-KC\i^^ B12-K04E; 

MC CPI: B04-C;Jit; ba-i-e>'i; c i x ■ ■ — ^-^^ ^ ■ 
D05-H09; J04-B01 
EPI : SO?-E14H4; TOL-J 

T-H.OLOGY POCUr-'n.STPnHEnTATIO. .U. TESTIUG Pr- f er re ^ jr.at.r ials : MBP, 
and tne assay rea-gents, are binding proteins, an.. ....i^s o.r 

.aectrical and/or magnetic signal, t let ei i .-d ^ i . . eo . 

. ^rnHit^r.rs t'"-rh MB? (:ne ..aoeied, the other Vv.tn an 

-nvirC'nmentdl oonaii. ^'-no , l-'-h ^.i-^r ^ ^n 

affinity tag) are rorr.mned ,y t^-y y- 0 -y'-;-^ J^;^.^,.^ ^^.^ to 
affinity tag r^i^tner (atp, . ^c.nv . .j^.^^^^^^^^^^^^^ ^..^ ^^^^^^^^ 

the siqiiai detecte.-.-! in b.,. .--.r,..i. ......... -^^ , r. vires :r./t ime to 

base of F.C and atp is inurK^bi . 1 .-.d .n y ■ /y -y t; y ^ ; 1 , ^, p,- ^^.^ 
co:npleti.on, a label , pre rerab y _ ii Ke- - , .-^ ■ ^ -^^^^^ .^^^ 

signal frorr, the laoel ^ 0 ■^'^'-'^'^ . :y.y,^: O 'y - e t e of cnana.-. To 
b.sea on me absolute va.ue of ^ V f ^ 1. „ , S y .Id, an. an ass.y 
d,.te.:t a -ieviant .ssay resul , . y^ ^^^/y ' ;"ty ^ .t,,y , ,eparat.e0.ones 
result iK^) anl ar, lAC si-gnal ore ^"^t.. te.n ra t a t ^ . J^^^ 

bH-ween each lAC and a mean value of I.-^b i.. d.,t,im n a a a 
fy each vaiae, multiplied by a constant (neta and :''y y ' 

..-.a.-.-h-- Tr..= t:.tal ic .sutat ra -t e d fr::r.i Fm, oi. . ..,u... - 1 ..nt ret,>.,r 
a..uh. ....g--n . - ^ .le-eriTiinei ov a rati-x 

and this r.:tai i:; ..i.itermxii.= - .- ^ - - ^ , . . 

function or cy luvaar re:,ression fr:m_a -.y—yyy.y S:,, f Terences 
tetv;een assay res.iits yy yyy^ Tzir .^ecna- vllyl i a ) a function 
between lA.? and ipean value f^r .At. .A., ^ecna . - , / petvje-n Tm and 
f is determined for each lA^: ann a qu-;tient oal.^.ated bet.je.n 



TECH 
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X + f ^ .jr ft ) a difference between each lAC ami a mean value of lAC is 
dett-r'i ined, multiplied oy a constant, the su:l of 1+ e-ery j 

pr."-du-t calculated, and a quotien:: between Tin and thi.-; sum letermmed. The 
j I AC values are particularly signals measured in th^. new processes. 
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Tl Predict jn-iT presence of atinorn.al levels -'..'t protein m ol': od 
c 1 o 1 1 1 r n J -z a s c a d':,^ . 

DC Bi"'4 ;:;r::: toi 

III EFAUn, F; FI3CHEF, T J; dl\TNS, T B 
P7\ (A^LrOJ) Ar'2'J MODE.. FO 

CYC 2 4 , , 

we 9--rM--r).-: Al 13:-9070o ■1999:0;:^ EN G01N'-i3-49 

P.W: AT BE CH rv i.E FF EC Fl Fr CB GF IE IT LF HC FF PT EE 
W: AU CA JP KR US 
All ^'9L9mj3 a 1j99071'^ (I'^F^^Fl, GO IIF: : c-4 'ir^ 

EP 1042^*V) Al CuOOlCll (2(iO0Fj; EM G0in-:O-4 9 

P- AT BE CH CY DE FK ES FI FF GB GF IE IT LI LU ['C HF PT SE 

.rrp M-- <vv-' 4 A1 WO 1 ■v:*R-'isj78.)^, l'^'-i;n230; AO 9019503 A AF 19':^O-19003 

";9eio^a;'EP 10u2o.;> Al F? l'^08--64 34 F 19931230, W'9 r;Y ) ^ - UFO 7 R b!5 1 9981230 

EFT AO 99rr'303 A Based -n v;0 9'-t34 2C-:; FP 1042')b^i Al Baseu on WO 9934208 

p^jxj [] c ^ 0 ■ t ',' - 1 i"j 4 7 1 'V'^ 7 1 2 3 1 

IC I'3M G0i!]033-49 

I^CB G01F0 33-8 6; G06F019-00 

AB W- 9934208 A FPAD: 19990^v:i2 

N'jVELTY - Predictin'^j the presence t an annormal ie-.--.i. of :.t ±east ± 
protein m the clottinq cascaie irom least one - . r:.e-d'rp--naent 
measurement profile inv'-d_ve3 using a mcdei mat repi>:-s-nrs the 
relationship^ t^etween ohe aonormai level of tiie protein ifi tihc 



clottin.j cascade and a set of pre^ic-:r variables 

DETAILED DESCRIPTI'ON - Predict in'_:t the presence or an abnormal level 
of at least 1 protein m the clotting cascade from ar least one 
t im-^-d^P'^ndent measurement profile comprises: 

(1) performing at least one time-dependent measurement on an unknown 
sample and measuring a respective property over time ro derive a 
time-dependent measurement profile; 

(2) defining a set of predictor variables which define one data ot 
the t im.^^-dependent measurement profile; 

(3) deriving a model rhat represents the relat iL^nship between the 
abnormal level of the protein in the clotting casca;3e and the 

set of predictor variables and 

(4) using the model of step (c) to predict the existence of the 
abnormal level of the protein in the clotting cascade and to 
predict which protein or proteins in the clotting 

cascaoe are at an abnorrrial level. 

Th*- [-.rediction of the protein or proteins at ari 
abnormal level is a better prediction tnan an abnorr.al clot time alone. 

An IF'DEPENDEFT CLAIM is alsc included fjr pres^.-ntjng a relationship 
b-tween data from an assay relating to throir.oosis-h-nost asis on an unknown 
sample and data from assays relating t t hrombosis-nemos tasis from known 
sample populations which comF^rises: 

(A) providm.T data from at least one time depenuent measurement 
profile fcr each r known blc.od sam.p.les; 

(B) perform.mg at least one r ime- Yepenaent measurement ■: n an unknown 
blood 'sample and measuring a respective property over time to derive at 
least one time-dependent measurement profile for the urn:nown 

fc lood sar.'ple; , . ^ ^ ^ , , 

r -■ ^ ^ ~ 4- - ,.+-,~..^, rM +-,'. o^f^d^'^^- " "ariatles which 

capture the information content of both the unknown blood sample 
t ime-depjendent m.easurement profile and the i:ncwn tlijo sample 
time-dependent m.easurement profiles and 

(D) presenting the data from the unknown blood sample time-dependent 
measurement profile relative to the iata frcm the known blood 
sample time-dependent measurement profiles. 
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USE - Used f'jr estimating trie C':>nc^ET:t ra t ion oi at least 'jne^ 

protein in the cli'ttLng ■::asca:le. 

[iwg . LI / J 
FS CFI E?l 
FA AF; i:>CM 

MC CL^I: B'i4-B04D.T; B04-N04; BlI-::07; Bl.i-Fj4 
E t I : o 5 - E 1 4 H ; 3 -J -El 4 H 1 ; T ' ) 1 - ■ T 

T E C H U VX 7. : ld9 l^Z- 

TECHMt)L(7(3Y Ft't'Uo - B IOTElIHN'JL'J'J Y - Preferred methed: In cne pteiictien ot 
tl"ie protein at an ^t^i-j rni..i.I l*:;ve.. , th-:- ]r>ec L f ic i t;^' is at ieasr 
7'. "5 a:":d ^' h-- .:--ri;:- i '. i :.t y is gre^-i^.er tnar. 0.6. The samples w is; n ^ ineasured 
:-';n'::ensiati':-n ot ti^^n ■■ J ^: ^:)f: normal t'jr a s}:'e::ific fai:::.!; r ire d^rfined 

a- ir^drig ^rt ar. ak^ri'-^riaal bE-'z-l. 

Th'i- f: ime-oieo'endent n.»:^ ^sar-eiri-nt profile is as leas: one S'psi':;.::I 
profile is prr-TJiC-zd r y /in a;:t:.:[ia:.ed analyzer f:r t nr :iTir^^-: s i.s .^nd 
neiiiC-st asis restin-:;. A nuKiijL i irciiiai laea suroiu'- nss ar :-ne r more 

wavelengths are taken over time zo aerive as least cne 'i-ozical 
profile. The L'ptiiral neasi^rement s rresj: c nd t :■ cYi.:jir-^Q3 in l.igh*: 
ssatterin<j and/ i-r light ah sc^rp'^.iC'n in tne unknown sam.f.'le. The o-rtical 
measurements are taken over tirtitj to doi ive at F::sst Lr.e -p^siio^l 
profile, and tne ■:■l^ti^^al me-jsu ren.ent s ai'e 'i-ach nc-i-maFized zo a 
f irst '.:pti^ai measar er:iers:. . As. Lv;;3:ss 'jne :f'*:iCiil profile is 

p-rcjuo-a aut L^niat i L-.j 1 1 y k^y ar. Sin.ilyzer. I'ne snkn'jwn samrde i.= aa: c-mat i cal i y 
r.-LTTujve':! by -h^' aat .:.rr.e " ed r-r^F*:^ ^.rs-in a sainr F^ C'r-r.^ a i nt= r r ■:■ a t-r---:- wt;;? I i . ijne 
or mO'T^ r-agents me aat i-mat loa^ 1 y adcisci tc the -est v;e 1 1 sc^ iniriar*:- she 
r-r'jp'erry ohanges v/ithm the sampde. The develc-r'iuen t ■:■( the p.: ':r^--^rt'/ ever 
1 1 ip.e i s au t orca t i o .ei 1 1 y o-r-t i 1 1 y ri'S-n i t r ed . 

A preaioted oc'nqenital oi aoquirea imcaianoe : her ^p-eut i c -:-':-ndi 1 1 ct: is 
ar;t c-ma*: r c.";i 1 1 y stca-d :s": .3 n^-moty if tne 'n'.^t .:.ma c d anal'^^'zer' ai;a'-:r 
displayed on: the Eiut'inatea E^nilyzer. Assays f'l-r oC'Sflrming ths^ 'rxist'E-nze 
ot the izoi'genital <.-i asqaii-d liaka ian-: e or s he r ap^eat i zndl i .^n ar- 

ij -H f-.TT- ;2. +- i,2r-i I ly ef f'^''*.:^d. A S'''t 'S ciit ^ fr-zrr. ^ncwn s iiT'.r le:E" is t^r E'0*.ooed wn.ioh 
is used f'sr 'i^rivi.rig tne m':siel. The 'data f r 'Om knc'wn s.^imrdes is cli-tained by 
fj ' r f L' r 1 u i n g as s a y s ri t h e k n w n s 5 irp* 4 e s . 

Tsnie dep^eniient measurement profiles inosFacie at least row-:- 
profiles f ro'iri assays initiased v.dth FT reags-nss, APIT leager-:?, 
frh'rinogen reiagenr.s and TT reagents. 

In (1), steps (3) and (4) ■r-:-mpu: i se transforming a set of input p-arameters 

fri:ni the t iiiLe-deperident Kieasur en.ent profiles f':^r the- k.ri'jwn bJ 'i-jd 

sampdes ana the unknown blood sampde, tS' O'jrresr-i'ndin^j individual ii'Utput 

neurons whose Ic'cat ion c^n an ii utr'Ut map cc^r r espcTids zo the respective 

input data. Step' (4) als-:' comprises : 

(1) selecting weight vectors; 

( 2 ) select! ng a s anrd e f r on: a t r a i :\ i ng set ; 

( ?d identifying best matching winning neur-in at a pa r t i 'E'u i ai- time; 
(4) up'dating weigh: vect-zrs and 

(t) rer.oating step'S (l)-{4) un^il the iT.ap' reaznes ecr . i 1 ibrian. . 
In step' (.:'), -riata frO'm tne t ini'E' - le^'ender^t n-.ea z urement profiles 
is trans fZ'rmea into p'rc'di ct z-r viriaoles that charazteiize timing, rate and 
magnitude of >:hanges during the t irr.e - :i'Zpenaent :ueasur-;-ment profile 
and< the predictor V:irLar)Les ar-;:^ used as inp-ut fz-r- near 5:1 netv/z-rks. The 
d-d:dnition of the predictz-r vard^bles is i pios i : i 'Zn iri a se 1 f - zu g.an i z.i ng 
f-atur-z map', trained with data f rzim the t ime- liec'enden t measurement 
profiles fcr the knov/n bbzoJ s.rsuples. 

Preferred materials: The proteins z-zmc-rise faztz-rs IT, V, VI : , 
VIII, IX, X, XI anJ/zr XII. The er.z-wn blzzcl samples ar.:i tne ..n^nz^wn r-ljz*i 
sample are samples of wh'Zle bl'jzd, f'lasma, zir zther part of wnzie rzlz-zi. 
The knz)v;n i:dz>od samples are samrlez zf v;ni.zh are inf zr-mat 1 or. is -cnzwn 
relatin;; to zr.e or mz)re intrinziz zr ^'xrrir.siz sl-zttir.g fazt:)rs in-:i/zr 
t Ti'^r ar)'^u t i az'*^;nts. Tne .-ir. zv/n blzc)l sam.cdez are sa.mLcdes zf "'.zzi-zt. arz' knz'wn 
tne pi-eserice z>r abserize zf z^ne ■: r mzre aonzmma lit i es relatir::i tz at least 
one zf fit^rinogen level, zral :r.t i z z a au ] ant , neparir., ami zn-- zr m::r-z 
fact z r levels . 

In step (4) one oi" more of normtal saniple, presence cd neparin, and o.ne zr 
m.zre fa^ztzr ief ici eri z ies are presented zn a PT m.ap, zr at l-ezast z^ne of 
normal specimen, presence of hep^arin, ar-normal fibrinzgen, zral 



AFTT iiia}^. The predi':-tor variaoles are in terms of a standard deviation 
fi"':'ni a i^ean c^f at least ^'ne Kn':ojn h>Lo:'d sainp'ie popijl a t ic-n , .invi the un<nov;n 
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L^d sanip'l^^ is char.a'::t^r i z- .-d by va r.i. ^ t ic^n from zhe mean of the knc'wr. 
o-i s<ji:iples for each predjorr;,:- vari^bi-. 
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DNN n l':-* ^'J^- ?■ b"' '-I odb 9 9'.'- bb . 8 5*^: 

TI C.n b:u] :.na relative st^^jl.d 1 1 1 i-..: o: f.vc im:ieeule: fr^im 0!?n f orina ^ isnal. free 
ener-;! i es . 

DC 99/ 9.::^ jit 9 .o^ ;-b toi 

IN ?' :'Lti9.;"VAr;:b i 

PA bJYtb'j n!:i7 CC^LbMPIA I^JEW YtK?: 
CYC 2: 

PI .VM 722: Al : ■-9'*';'-b': ^ ( b^^ "^'-^ ■ 5 i ' EM c ?t) G-b5Ftl7-ln o- 

PW: A? tb CH CY bb lb bC Fl FP i-b 9b I b I T bb I]C bb PT bb 
W: Ai: CA JP Mb Ub 
A Y 9 r' 9 7 7 9 A " 9 ' -> ' -1 . '■ ( ■ 9 9 'r^ t ] 

lb: 6178294 Bl .^('Ol^niio {^oOl^.J] G'b^F': 1 9"9:: <-- 

APT W'j ■.b^l72^2 Al W(j b:^99-bb2ij 
b-i 9 'b'C P ; bC 1 7 8 .9:^ 4 hi 'A? 

F Ei T A ' I 9 ■■- '? 7 7 ■> A based - :■ r i Vl^~' 9 

PRAI tt: 19'^7-'^.p'il4 ': b-:i7:v:i29 

IC nbb G06F017-16; G06F019-C0 

AB \-iO 9 31 7222 A bPAB: 19Y9C998 

l/jVbPTY - Trie reL^^iv^- starilities of t\^o incd eoai>-s , bavin:: d':^fined 
f.^nnect 1 vit les ari'.x exist -na in -Yef.inea envi riinmem. s , are i:;etermined iiy 
o.imprurer analysis C'f i"i-^"..u r err-, sent i n l'jv/-^:;r:era'^' minimum o-in f i r mat ic^ns to 
ij-ner.-jre .-^ t.i'tal r^^n ii o rat i n ir;-:ei:ir il (TCI) .-nvi trie.:- tn^i- eon f 'i:rtrat i iiial 
iree ener^Ty (CE^b) fir irttn mtfe-ilts, th-rn ■tomparinq the t'Fbs. 

L'bTAbbbb n^bbCrd rt^bib - Th- initial data set is oeriveo from moleialar 
'.^■"■nneot 1 vit y ^ a pitentia^ erier'.ry function ar.d a e-'.i": f -jj: ]iL:it i ma L 
s^jiiroh method. A ■ii^n l i qurat le n rnteqral, in ail -.i-."^re-'S of freedoni, is 
r-alculat ed h^ased on the oc^nt r it'Ut ii^n cf each moleoalar oonf 'irmat ion by 
P'or for rung imp'Oi t ince- samp'l ing, mul t iai men si ona 1 Monte Caibi {MC) 
iritegration over u multidimensional vcdume (aepierjuent on naim:nic 
vit'rat ic nal f requen'Si^-s of the ■conformation and t enip-e mature ) encoding a 
current con t 'trmat i iTi -i-f the m.olecule in ton f ormat i iin 3 I sp^ace . 'Ihis pr ocess 
IS repeated for every con f 'trmat ion in the data set an^l the contrir'Uti'tns 
surrined to generate TCI . 

The im;:'Ortance sampling involves pre t er^^nt ial ly sanp^ling regi-jns of 
the volume naving oommant 'tint ribut iins to the t^FE tf the conformati'tn 
and uses a r-hysical bacbgri-und C'f normal modes of virvration, with sampling 
P'orformed using an a^'tmic co'trdinate transf'irm basec -tn eigenvecttrs -if 
tr.e Hessian matrix (Hi) asS'i-c i j t e^i v;ith the p-articular :tonf t rrrat ic-n . 
lUfiEPElIbEMT ::bA:MC are alS'-> iriclu-ied for th*L- fidlovving: 
(I) siirular meth'tc fc-r analyzirig molecular C'jmjt) 1 exes ti- determine 
r-bative Dindiiig affinities of nc'letules btr a given r:i>st mt'lecule; and 
(.:) ap'P'aratus f'lr the aria lysis. 

bCE - The method is used to select drug c an u L :i t es . 
A[)VA:iTAtjE - The methtrc sh':.'u]d i'ientify -gents V'vitn greater- 
specificity ft-r, ar.i a:-tLvity at, t.jrget elites. It ii^es not require 
vxr.ierisive free en-ugy s imubitions ot c tmpu t a t ion a 1 ' a 1 sherTr>^ ' , 
fiwg . P/ .1 
FS 'b'l EPI 
FA AP; DCb 

MC 'Tpl: A99-A; 394-E91; B94-bCu; Bi:-b98; B12-K04; D05-H09 
EPI: T9:-E04; T91-994C 

TECH bPT)!: 199 90 992 

I'ECHMOLOaY FO:UC - 9b3A!liC tHEMISTru - Preferred metnci: ine lov;-energy 
ramimum s j[.f :)rmat ioris ire 13-35 kJ/mt'le ar)ive the lt)Vie3t energy 

"'n f 'jrmat L-tn 5in:i the potential eriei'gy function is bated 'Ori 
ri'jlecuLar meohariiss or on ab init.itj, semi -emp i r i t al ir density- 
functional quantum mev:hanics. The attmic :ti>- j rdir: ates use:i are 
external ■: Cartes iari ) or internal (bond lengths, bonl tr torsional angles) 
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arri Hi is exact or an app rox urat "^r: . Th-- pre :ie- e rmmed envi rcTiment is 
presence of solvent, vacuun or a '.hird :o.'..ecile, and rhe nol^- cuies may be 
subjec---d ro -xti lh;:.:: env:: r ■ -nmen ^. ai f'a-'O:. Tlie molecules b^-inq ..inalyzed 
can be^.:y:'I.c -a- acyr-lic, p^a rt lea 1 ar 1 y v/..::er, b i :<p :d yrner s , b:. ol LQoruers , 
proteins, encynes, peptides, nucl-ic .vo. i, ^ 
carb'T.hydrares, qlycf.^r roteins, re:ept--r:s, lipid oil:.yer., l^xc:: :cm: -ar^o^e 
p^d'/ners, rretals, f :r..orqarii cjrrii. c-un :is , j -riaicruc-uL : cats , draas arid ±ru'^ 
c^mdidat'-s . 

'p-, ..^a'-'-*^ r:v-ile-:u:. ir ^:o:aT bo:-s , t'i'Or a:it:t sets are |f^n-ra^ed: one tcr eacti 
r,f rde^nwo tv/st :iL:b;:".oiles .^nd one b.^r i .:n jI tneir conplex--.;. The ::bE 

(" :"iisr 5n:i sec-'-n-i tes\ npoanis 11 m:! 

t;-!^ r-./:;. ■■^ompivxas are : ;i i. .a.-.-:: -.a i, aw . ..v, .......... ^ ■ , - . 

::ef me^ as r: _-rb . . „ , 

If is neoutive, t.-.on tne first ::.--sr :-:.:iv:rjuri .i iias sh- hi.'fber ai:oi_nity, 



pvo/^ldu> .^' may be reneatei with tn.ri :Tsde sales and ^r-iplexes. In i 
P:-rt:icul.ur case, r- ha :aethod :s used for diiferent s t> a:eoiso-ers ot the 
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.1 ::■ 'inf-S'an i nas h-^^b-r ."iilLn: 



sarae conpoanJ and m that c.ise .j.nly tn- difference b_ needs t 
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cal'-:ulared . In th^s pr^jcess, inrern..i:. araiaic jvojrdm ^tes ina 

..,f .,^^,1-,, |: rhe ■•>ranlex, lerined by 3 

1 r a n s 1 a 1 1 -on ^ 1 ani 3 i or. at i .>r.ai ■j-^-jiee^ 'J- ._i ^' Sm . 

^!:o:HlajL':':Y b'):b3 - ■dtMFdTld-:: Ad:: ■:d;]Tr-:b - Too appa-i-as ^apprises cevices 
abl.- t.-, a.-ner3te tne diita sets; :;er - r:^' i. ne ^:H'b and (0-;lca:.ate d.ffsrences 
v...;ween e FEs . Any worbstation si supe : co-pat a r shonii oe sair^ble, running 

^an "3aro sC'itware sii.di cio ^-..-^ -- ^^-^j-^ - ^ - . 4^ ^ . . - 

P-^h-ma^i-^: The rrevnod uses a new Xs .nte^rat.cn -e:nniiie, designateo 
r::ie m: eai a i ■: n (XldTA; , tnac .:rer:ites n all -ne -iegraes <d :ree:irm, 
^p-h -r.e harrT;cnic :3opr'.:x m-^t ic^n ijsei 1 i::r.:.viae ni ::!i:y accurate nuxerical 
mteqratior: ■:d the h iqn-dimens n c od igur n integral. 
A samplin-:! function, very similar t: the partitiin 
function near the low energy minimal on the potential energy 
surfa.-^-, IS derived by applying tne harmonic app r ox baa t ion to each 
low-energy well by 3 local, seoond-oroer Tayior expansion ot the potential 
-n^rgy function. The saiiipliriy functions are n : rmc]. - ze a 

gaussiins defined in terms of tne bottom of the particular energy well and 
its associated Ic-r-al Hi. 
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F.W: AT EE CH r^' EE DK EA EG EI EF GF G\\ C: 1 GR IE IT KE EG Ln MC MW 
OA ?T GiJ GE G2 UG EW 
W: AL AM AT AE AE BA BF BC^ BY ::;A ' Ti C:: CO C2 f E ['E EF. EG E. r,h (,E 
GH 'd/ ^^F hi: IE IE IE GF EE EG Er EE EE EG EE EE EG LT EE LV ME MG 
ME ME IG-J :C-: Eo d2 M. FT F'; Go GE E^ GI GE GE TJ TM TE TT EA EG 

[]Z VP YE EVJ 

AG ■V311143 A iyv>05:7 [ l9'^:^:'-':n C07EO"7-OP 0-- 

pn T0-C'-3A ^1 2' Es09EG Add;:'.!!-;) EE :Ctd-'O:"i7-0 < 
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FCT AH a<^lli4:- A Bised :'n W;.) ;':^216 7"d Er 1032 28 d Al Ba.-. ed cn VP- 9oE18 78 
PFAI EG 19-;E/-6:d;14 1.-^1114; UG 1^97-^2783 1^)9710^4 

IC ICM C07K007-08 

ICS G0i^EJ24-^)8; G'ilE033-E8; G06F017-30; G06F019-00 
AB 9 921878 A UPAB: 199907 14 



tr.e sa::!e ac:tivity a? (II) i^re :.e\i , 

CrlTAILED DE:I:R1 PT] ON - (1) are ol foin.ula X-C-^ - -ys -X- X i -v-- X-Cy s -X 3- 
X-A4-XV-X-X-X-^:-ys (I'i 

X, XI, y... , bri-i '/A samt' -jr "Afte.ient natural, m jdi t i^^d or 

n-Mi-na t" ura i .ir-iinc* a:-i-ls; 
>:';■ ■: r Asn 

] IIDEPENXENT OXAIMo are ais.i) i :.cl :d-E- a Icr the ijlLovjinq: 
1 ) a peptide anai'jq, peptide mirnic' ctr mim'^^^ir- 
fA) 'i-^t sr-^-'-:i 1^ L : T'or ore or rio> e ;^ ifc.^ yf^es o i: r^ ^r^-ral nic::t:inic 

aiToty] ■iTii ine recep:: r fnA-:hP.} ; 

c-:nj.arLnq r ar.J -zlt ra:e5 f':a Hi .in:; a r-a: c :n":]: „.n to th^ 
'j;^ : f '-I en^. auh-jnita ■: f n/vChr / 

i'i -er : i -e, peptide sniJc:, peptide ni L:[L': t i : 
■ :a: min-'r ^: i ■: (r; -.d (ID wnioh a lec^ i vei'/ :r.: dala as i :: I c^a : cal arfivit'/ nf 
nA';hF-; 

(4) ine^^hod f'-.^r -'Jete rmini r-g thv t liree-'Jiinen.-: lor {:-[;) 3':ru::ture of 
an alp^ha -'.:or\orr.xir] ana^'Hg (Ila., : r part :f it; 

(:■) rnethc-j i'::r soler.ting anc ■-es iqnina cinLVourris aii.ie to bin.i to 
nA'.;hK by analyzm-.j the :.-D struoturo o^f (11), oi part ot it; and 

(>■.■;■ m achi ne - r eadat: 1 e Oata atoj "K^e irediuiri vvji^ct cin (display a 
qrapni::i:: J-L^ r -^p: : eaar.t 'i t ion of an :-X[ r peptide cr c: n: alpha 
-c:-jn ot ::-:-:in (-.a: paro;-3 ■■f therr) . 

MXCHAIJI^-M 'IF ACTIGi; - Spo:afro .ntera:t_::n wioh neuronal niootanio 
aO'}-ty] --hC' I r-c^})tr)L- s . 

OoE - and i[;iinet Lo;-3 , ,ire usofai a:f oar :il jvas-'miar agents; ror 

tieating or -.ir a<.K"-'--;-^^ng anall-oell ^anq crar :r ri ::ir:a ; and as i.iastric notrlity, 
nrmary i noor. r ::.ner.oe nni ant i - -Brnc k.i ng acjen^s. 

AL^VAHTAoE - oio ^ind thei.r roi -a ■ ^- ■;. c o can h-i' oc^Gi^nod tr^ be seloct^'/e 
v.cjr. partioular sul::Oypes of nACliK, farticulariy the alpna 3 beta 2 and 
a 1 f-li c; .;■ rota 4 .r ur.' o y pe s . 
Dwg. 0/14 
FS CPI EPI 
FA AB; DCn 

MC CFI: B04-O01E.; E.04~IjO:; B04-N04A; rll-COyE; B12-K04E; 

B14-F(:)l; R14-H01; B14-M01B; Bl^-UO^; D05-H09; [iV>-h::.:B2 
EPI: S03-Ea7; Sn3-E14.M; TOl-J; TOi-.TOSB 

TECH f.iPT>:: :9'^90714 

TECHlIOLdGV FOCUo - PHAPMACEQTICALS - Preferrea comp:: ands : (I) is 
C-bimidated ana has XI = Asn; :-X: = Fro or hy^ir'^xy-Pr-:' ; :':3 and X5 - His; X4 
= "^lu. They have practioally tne sane oiologi.jal activity as : II ;■ . {A) are 
sp^ecrfic for one of the receptor subtypes alpha3tiet.:il ; alph5i3beta4 or 
aipha2bet al' . 

TECHnOLC'GY FOCUS - OF'GANIC CHEMISTPY - Preferred process: To determine the 
3-D structure of Cla), MMR data are obtained for^ (Tla) and a.naly::ed, vd.th 
reference to Known structural cc-oi linHtes f'jr Mil, by mol.eoular 
replacement. Seiecled (Ha) may tnen t-e tested tor ;:^:ndina ■ c> an nAChP; 
innibitron of hmding of ligan-i t nAChP artd/^-.r i nh i h.i r. i.ori of bicdoricaJ 
function mediated t^y a natural li^and of nAChP. To .-—.ect a 
compound atifo to bind t-j nAChP, th-r 3-D stru::tuie o!^ Mil is analy-.ed to 
identity regi-jns that interact v;itt. p^articilir functional 

gr^nups, then .ilatabases examine'.! for c^mp^:>un:is hiavin ? structures that best 
fit these interacting r-qions T't iesi;:n -a c-jmpound iL>le to bir.d tC' nACtiR, 
a 3-D representation Laced on --o] :ii nates d-:fir.in:i *:he structure {of 
p^art) r-f [11] is disf.iayel and i nt e ra ::t icTi wi*-.h c-arts of a ^.nown ligand 
are -na ra-t e r i z-^rd to identify suitable giC'Ups f c r replacement. Croups that 
car: repda.-e one or mc^re parts of the llgan<:i, -while retaining ot least sc^me 
tindmg affiriity, are then identifiea fr-:rTi a on<:wLe;iue i:>ase of oc'ssltde 
sul-stit at iC'n ■:ir:.ufs. Alternatively, p^i^sitic^ns tnat .ire ti^referred with 
respect to f a vor atdlit y -tf interactions with functional groups 
are identified :.n the 3-o stru^rture, Eind parts rf a kn<jwn ligand close to 
these p:-siti'cns are chara ::terised . Frtm a .oncwledge base, frac:mer:ts c r 
molecules that r^ermit connect! :n :-f the preferred p^siti^ns tt porticns of 
the ligand are ilentifiei and the structure of the ligand is modified by 
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c^iival'jnt attachment 'if tht.'se fra<:imentr r;r iaol*:";ju 1 es , in the i".'r':;per 
iDt i.'^ntat i-'^n and loca*iori. !'>-'rripL'ar.cis cj* ■ s i qritrd thu^. v-vriv ar(j tht-n tested lor 
it'jdulat ■■■ry activity .jt nACtu-. . 

F r epa ra t iv'n : (I) aie pi'*'duL>-d ijy u^iial nietn''d.:; of peptide 
synthes is . 

Tf-V:i-iN' .Li 'I ;y Fij^MC - 3 i OTiiCiiN-.'L'JfjV - ?r-parat.on: il) ::;^.y F.e prC'duoea by 
uvu.iil r-' :-:'mb i nant L'NA rii- ■ t ri-j' i;? (r.<:'t ex-nir:)! i f ^ ed ) . 

TFi::HNt'Li,Hyr fri-^iic - (>.)MF'.it::']C AI';[' C'jNT^-.L'L - rhe data storage rfi^^-dium 
disf:'^ay;v a --0 rerr ^-oen^ a^. :.o r. ija.:e':: or. all -^onie of " ri*=- ^itorrd.c 
c .■■j^^i^ n.: t.'.-s t a ken i i -m -x t-.iMe, r ec:rc < laced i n t he spec i c at iC'n, «:-f the 
3 A'^i .-t'l'i.i- (.'f fix / oep'"'3.i.^ ^".e ■• r 'oo -.n : VirO Protein oata 

:<jn": -i;- 1:'l2i-, (>r 'i-r. /C''"! rdm ^ ^ e.r v;itn a I'o :o:-n'';?.::r.-squ art- deviation, fcr 
!..o ::.i-e I '.'-d io^ o -d 'jn- ■ protein ■:':.'\-v^^ , ■: f olO tv-. r 1 . oArigst r c^rr. . The 
rr.-L'ii -irt it.^y carry * wo st-* .3 'lia^ a : o-r.- <h 3o.-.rl':r trunohMin of at i east 
p^jrt :r tn-.:- c.crdir.ates s^jo o: f .i above and th-- c-ttier t r.t^- toordinates of a 
!!■ .. 1.^;. omI e or in- ol -■"^o ! a r O'. iuf > L-o-: . 

t'( )FVrdGhT 23C: ['nFWH:hT I :]F':!RI'1AT Tt'N I;TF 
Ol o :^ - 74 

o:. it^t;^ t o .!■ ;r i OK. ohann^'-l t.Sf:-fol J ti (::ene therapy. 

M; d::3/:iA, l' 

\ . ; i ^ ^ :e3^ I ^ FN CloN ; I : - 1 1 

i i/-. r .h ■ or 3 3 - 3Vj 

Iv : /'.h 33-: 333 A'' F/. i-^F Fl- ::r. F i' CA Fn 3'F FF F?' EF Fo i-'I Gh (3D 

t-F G:: CI' HF F Ih 13 CP l-V; v:? ;<'7, ni ' op; Or. 'L'o FT 13; ;.V MF 

Mt^ MiF i:n mw h:-: :f3 uf ;"f ft fa- fv ff ff fg f: ff: ft tf tm t?. tt ua 
ft; tFV vi: i':' Fv; 

A'.) 9'^lllFF A l';^?:'tOF10 (lF;o:i-::) (3lFN::dA-]I 
FF 1';FFo.;F A1 Fi/OOFxF: (^'o:;ti41) Fi: (71 .d']::i lA - 1 1 

R: AT BF ^Al CV C'F :■?: FF Fl F?. F;F <.;F. IF A: LI LF MC NL FT FF 
ALT ■;ohj07F-1 A1 W3' n:-: -ddcFF I 1 ''^ 1 OF 1 ; AF '.i^IlI^F A All 13F9-lil22 

l:^FtA'A.l; EF I029;>;2 A: FI' 1 3 33 - FF A:' F i;oA3i,.21, Wt- 13---UF22321 1 9 ^61021 
Fl>T AU 9-^:11.. '2 :\ &.-ise'-: o-n Wt) d9..iF7F.l; EF I02901F 3a1 Faoed -^n WF) 9920734 
PFAI ng 1 ^ ->;:: ~ ^ 0 2 a;^9- 023 3 ; "3 1 97 - 3 3 1 A:: ■:9A71C:^2 

IC l^'^A GloNOla-l: 

l-'^S C07il02 : -:a1 ; C07K014-00; C07K016-00; '1 1 3 : 3:- 1 5- 3 : ; 

C 1 3 ! J Ld 3 - 3 3 ; ' " I 3 '2 iJ 0 1 - o 0 ; CI., F' 0 3 1 - 1 ■ 3 ; i; 0 A I 'J 1 3 - 3 : ; C i)l M 0 3 3-367; 
G06F019-00 
AB W<;' ■.^9^(17 3 4 A oPAL : A.^f907 M 

Kt'VELTV - A V'::Ara';ie-';ra' r-H ^en.ri'it'e ptA n-- oi uin iC'i ^nnel FloF, t:0-:pressed 
ir. s}:'erni..ito-::yte^^, is riev;. 

l.iFTAILFi: FE3:3F'.I PT A3M - Flo^ ha.:', as a rt 'n^jiii-'r , ::a J ■:;u.L at ed ni-j 1 (:'Oular 
v/-i.:jht AA"i-13F kHi; nas nr.it ':--jr:':iuct iin'-'o : as functional 

t'-traTier, when exr. ress*:r':i \ri IF^njC'-j;:. ■jj-::ytes; ^tf -li-IFO r)F; has intreased 
a -tivity ar i n-^ ra^ 1 1 u 1 a r ph ak-tve Mt- 'Ot T.l .:;r.d binds sr^^^ - 1 f 1 oa i I y to 
p'AyrltroiL .intirjodieo avriii ntA s-;-quen:o-s ;FI', t F^ ) , ( ? :d or {['4), 1 1 
given : ri th-"- spe - 1 M a t t jn , wit ri 1111 and atjtit 1 Llf AA-C and 1023 ammo 
a -ids, r-'iSf.eot : ve Ly . 

: :][)F;PENFE:iT t^IAAMC are xls-j Ln:Jo:ieti f-:.-r tne foLlawin;!: 

;i) ^S'jlated nuclei atod ■!) en :oding Flol; 

■2} IS..' la ted narleio: a .0: is -la) -r. ::)iLr. i at Least 15 sent Lgaous amino 
a'Ads frs):ri Mj:, --.wl t^.eir- ■ oinse rva t i s.'e 1 y moAifi*ad trari..ants; 

;v an" io-:.di-:s o^da ■ ^.na" rAnd se:e:tively to murine or numan FLo3; 
;4; •■O'ipressi'jn vector- c ant a: nin :j 
;f n.js^- --^el ! transf-^cted t;ith tnls vectar; 

t -A- mett.'ju f^'t i :ient. :.f yinu aqents (A) t.nat increase or dearease 
a.'jn-fiux t-nra-ugh a pH-sensitive potassium, ah.innel; 

( ) detecting Slo3 in mammalian tissue oy reaction with a seleative 
binding agent; 



Zhou - -'9 / 4 9 3401 



(3) cnmf;.uter irethod of ocreoning for nutations in S.Io3 aenes; and 
(3) c.;.nf.uter n,etho:i foi i-:ie:r: i f yin ^ a t hree-dirr-ns lonao Olr;> 
structure. 

A<"^TIVI^V - '-^cntraoeptivo; f ort i I it y- r.;M5uiat- ng . 

M^-HA-nm OF ?:2TZ0n - I'lc^ is ifo/olo^ei m s};.erm capacitatiDii and/or 
the a-r^some reaction, essenrial stef.s in t;er e i ... i zat i on 



'p-r' - ' sio^ and (I), enco.-ling iv, :ire used ro id-^ntify sp 
" ' y' . . , , . ^ ; 1 ' . , ,^ f.,] f,,v- t- q -i nrj infer+"dlitv and 
nh^ it-"^^- - and a-jtivar<ars yt-'O . -in . - y ^o-:^.^.^. j _ . 

as c^.ntr..coroives), also for srudymo soern pnyoiology m vitro 
Slc.^-spooLtic antibodies are asel for cirjaost:-' ^ieceoMon of olo. 

-1-.^ ^, 1 .'.^ - ^-o^iTw-rs voL-ri ano* n--r co<inro:d.. 

protein: "'^^-.'''Ded.e. a^a r.p-: rter^ f ■. r r.easurir.: .n.nues m 
pry assiurri ojn aen tra ^ ilOi , oaire:iT: iie^a, ^-.oi i.^u.., y _ _ 

Ki a irrrnt- 'f a-o a:-efui as nr-res for i :ieri 1 1 r ymy homols.gs, 

v=r^ar.ts and nutant.v asscci .t-d witr. air-ase; > lete... .......c^. 

or protein; for chi i.mon .ra. :2 Uza^: i :n ; m y^.ne ther.py; ty 
:.ientily.r..i poteni.:i.l modulators; t: measure up-regula: i.r. .1.1-.. ir. drug 
screer.in; iss^yr ai:a for production on recoribonant .>-lo:, protein. 
Dwq. 0/4 

FS cp: efi 

D05-H09; D.v;-Hll; DO'-Hi:,^.; r-G^^-i; L2F,; L(.)b-H14; oU?-Hl 
EP: : .■;i'-E14i:4; TOl-J 
TECH !.:??:■:: 1 9':->:''07 1 4 

J-.,,,;,,, _ pi,vr,.:-;Y - Proferiod nucleic, -JCia: (I) enoo^€.:S murine cr 

hnn.^- --l-^ .-.rei-:itio-.ai-V \t.j, i;.ul .l. , --r , t , - - 

:C"y.;nrid^-.-^' under T,oderat..lv r inge.-.t crnditions v;ith sequence^ u^i), 
y:yyv;Vy-r :.:4), ct a^ou-: 0.-^, and ;.l , resP-^ : r ive Ly , qivon 

E-M^.l'i: 'r^ri^di^-nd; suitable lor .npi i 1 icat ion of tnes. fo.r sequences 

:ono:;rof^pH-Lns^t?sr;:td;;:ui^ 

more than GO", ammo acid sequence identity with ^he , o-64 1 re • on of the 
Slo? core domain and is recognized by anticodies rai.-.-d .^gains. the core 

domain of (PI), (Pi)/ ^f-'d 
CTCGAAcT'lCCTAAAATilTTACAGAT (tI5) 
TTCC'^TTGAGCCAGGGGTC.ACCAGAATT ( 116 ) 
TCTGCTTTGTGAAGCTAAATCT ("w i 
TTTCAAAGCoTCTTTAGCGGTAA (Hi / 
TTATGCCTGGATCTGCACTCTACATG (119 ) 
ATAGTTTCGGTCTAGTAcCGAAA (1 11.0 j 
GGCAGCGGTCATTCTTTCGTCCTT ( III 1 i 

TC-C^^^AAAACCTCAACCC/^-AA-ATA CHI) , . ^ ^ 

P--ferr^d r.r.-.-=- = s: To identify (A), a test compouna is applied to a 
pydrvntyOceiis (or its membrane) that has e.v.presseo a olvl monomer, and 
:";'^e?:-:-' o^- tne compound determin^:.., particularly ty moasuxing the "hole 
";{l conductance to detect increase/aecrease mien flux. Particularly the 
monomer is a recombinant protein. To detect SloJ, the speci c 
binding a.^ent is an antibovry, oligor.uclectide or nucieic c.. id probe. xO 
..-r-enO-or mutations in Slo 3 genes, .sequences (particularly w..eie 
associated with oiseasei aie compared, over at least ..0 nucK.otides (nt), 
v.-th s.---:u-nces (HI), (Hid, (113) r (114). . , ■ „^ 

;ypy,:0:n: Antibodies against Slo^ are raised by usua. irai.unization and 

CO' 11 Puoi'jn K.ethsdo. 

T^-H1]'--L'-^GY Fr.ynS - Ct.vPl^TinG ANP C^oNTROL - Tc ietermme the 
ti.r^-^-dinensional strucrur^ uf Slo^ protein, Jaoa covering ^ a- 
l.^ast IC amina a:ias of the protein, r 3C nt if tne nucleic 
a-id ar^ aral'-zed to generate piiiaary protein soruc.ure, n.is 
mv ;h.=n'b" c.-;.vei ted, using energy terms associate-u with tne structure, 
t ■■■^ s^-ondar V tertiary structures, ana opoisnaiiy duis.jtr.py .a^ir.s 
the t-rtiary s^ru-ture used to devise a quaternary s t ru :ture . The 
resulting structures may te used t. identify ligan a-t mdmg sites and, 
frr.m these, the li ^ands themselves. 



TECHNViL jV FC'OUG - EIOTECHU^jLC'GY - Preparation: Expre^^sed sequence tag 

(EST) d4tc;iba::es v;ere searched with tdie C- 1. e r:::i n^a 1 reqion cf n.urine T'lol tc^ 

iJentify t.y.e norri'jicq /J\-"i725:b {GenEank) , -ieri.vea froiri a rrousf^ 

prc^rnye Lv. -y*- L^:: VJEHI-? . I ] i.ne cDUh lirrary. A 1254 bp impli'-on, generated 

! r-ani th^- ^d icinid ::C'r.t .:i i n i nq tr. is ES'V , v:as clmk.! t j srreen a rucuse testis 

cDNA lii.rary ^ <j i.srdare me t ll-lenqth. sequence iU]), froin two 

^ ver I iip'p ir.u f ra- jner.^ s , it;':^ :riur inf^ .:->aqu*vn .:e was us^rd to pr'T-vide pod. yrner use 

-anai.n r^actiC'M pr-iiuers l-jr ^mt 1 i ti':oat i n j\' r.uman testis :'L!MA, resuitirig 

111 se-iiu^rno- ill..}. Thi;:' v/.i: ^js^-i ta proi: e the s arn-a lihriry ander 

ri''de r :) t ly :Ot lm n':^er.': ■::c:.^i.tic:.3, t^' is:lat»;' r- c::l a nes, encacdnq tne two 

\-.a.r i.arjt s i -Ord .=;nd ■ ) . 'jrpae isoJ^Ted, tne '::!aNA may be as-:Tjre ssed in 



V.ddX ■:v)Kypq,q{T :JERWE^]T i:: EdRMAT I Ci: LTD 



AN ! ;^'^^d-idy^'-^-' M - . 

DNN r: 1 ^a-^- 1 1.-^.]^ t id ;^'-o-a- 

Tl K'jleL^o.i .:r a-:rs :.qn iay •.:-:<::. .:noffI:nq witnin <;i iibr^iry at JNA sequences. 

DC Pfi4 Dlt; TOl T'l^: 

IN h:ARUBE, 1; t:)KARE, V; dMMI?:i'RA, F! 

PA iKARtr-I; i'ARTBE I 

CYC s i 

PI W*:' '-?n--:18 AI i;^d'.tO:ll ibR?:da)* JA 4sp 'i 1 P t) j Q I - 6 

}iw: /i: bE !.:d t,:i ia-: pa pc id' PR gp g.:: t;i4 gr jp it KE P3 lu mc mw nl 

' A PT Gil GE GP tv; 

■.dl t;M HR Hd lii bP b: Jr r:E i'G KF KZ PC PR i.R LP sT bd bV MP MG MK 

Gb bW rp: Nt' KP iP F'T ?G- RL^ GP GP GG Gi GR Gi- TG TM TR TT dA UG US 
Pi' Vd VN ZW 

At^ Rbb:;-t.P A b>R^M:s^ !n''^^'-P]) 1 i pi J 1 - 6 ^ 

APT Vai -'PI!:b: Al \\0 i • -id ' : ^ 4 1 -pt: A ^ J ; AG :a: i S : { 7 P. PiJ b:) P - o 3 A/ i3a8LiP2o 

RPT A'/ -A—'--'" A P.-;a-;;'a an. bv' ;^-;'Mib: 

PRAI .'P b;'R^-G4pP7a 1 ^ ^ -AAA:: 

pp: : i'i^i iPi -t.p; 1 - P^: 

b;^G iP;7}|pp 1- JL'; C07K001-00; C07K007-00; 
C07K014-00; ClPb'dlP-ll; G06F015-42 
A3 V;(:j R a : b: : 3 a U PAb : bAj G ij i. 1 P 

b'P'AEPTV - Pi jI H'lrul ar aesiqn is ramed '"-ut by snuff. bing exa-ns within a 
library of sinilar C'NA sequences, foii'iwed by ranking of the pro<iucts of 
reshufrlmq by their fit t^-^ the desire-i functionality, an<i 
rep'eat.mq the shuffling and ransAng until a p'rc-iiu::: c>f the desired 
functionality is ah-tainea. A sinilar pro::ess may re carr.^ed out 
using polypeptide :^ e queri :es . 

jETPALEZ) L'EGCF.I r-'TItjv - New functionality is a-titained c r 
^.-xistinai functionality is inL}ora\'ed in polypeptides or 
nucieLC acicis P^y: 

(1; synt h^i-si sinq a gr<-: u}:- a f polypeptides or nucieic acids 
v; i t fi i ^ f r r i r. q s e qu e ri c e s ; 

U'q- rr.easurmq the ti^ ad" thes*:.- 3e':iuences t^a the desired 
functionality by iat-arat -try exp^er inent ; 

("A lankmq the sequen::es m <:'rde: of this decree a)f fit; 

(4) fu~er'aring a bshuffii.r:; lih'rsry' C' f seguer-o^es by shufflings parts 
f triese sequ-inces h-jtw^er; individuais r=inKing rA^i.Iy; 

{''■■] s ynth^'3 is i:aq the resuiting sn^-ffAed sequ-icces ; 

( ^'') r*..'pe..-it ing tl\e :jb<:.o;e iteris a s-.. f fA :ien t. nunuoer of tiiTies tai obtain 
ne 'jr n-rjre se-rueniaes w^th cl'ise f it t-: t he desired functionality 

G:."E - The teahrAqu- c^n c^e used f:r tne inipr a\'ement c>f 
function, and devisinq jf r:ev; functions, af 

polypeptides and nu^aleic a::ids, fa>r a P'road ranqe of uses such as 

crugs .:nd f jodstaffs. 

APGAN'TAGE - Evolution af sign functional fit is 

ratdd compared to the use a>f c^anvent ional genet it evolution 

:z 1 gc'i' 1 thms . 

jwg. j/4 
EG CPI EPI 
PA AB; DSr] 



CPI: Bii4-:01; BO^-Etl; B04-N0400E; Pll-:01; B12-K04E; 

EPI: T-)>a; T0b-[t02; TO^^-Plf- 
T^PH npTX: 19. + 9' '5 10 

TEPHIJ'"'^ '>'"-;V F'^'^US - ^Ic>Tyri{l\'.^LO^\ - ':'he ohuffl::ig .^.1 parts ot the seqwenjos 
ma- i-'^-cppiein^.nte.i ..y .st.p-- i whi.:h ■: or i:- ;.r. ba.os or ..min^ acids of 
thi .:-;quence are r:r.: r: . ... e-.l ;bv .i::tr.-^r. i.letr.M or . ubs t r t t . on ) . 

L123 AN;.;VJFR 49 t'F 7? V;?]:-! -MPVpp^u-: L EFWFi^T jNF0RMATI9N LTD 

i9--i9-i5^. [!•! ] :x 
r r.T- V : n7 [■;:■: i'p)-v4' 11:^ 

;;:,/:^tn':t.o--: o=.c::us r: n.uro: ri .ae r e ,- -loi:- e i-e:uiar.r i.F: polypeptxde 
ati'.i t: o 1. yi:.i-: Leo tide - ..:0-:!;i- 

t r-^-ii^moTi* f Sr.rGp t<:oo:: c ois : e ;: .0 : c 
sri:a.:itv^ ano men i t.a . t ^ . 

?K Bi'JwAh' S;'gF, Y; H.oL:4FS, b j; inGR;.HA[4, F a; THFOUF, J; KALL IF, N 
FAFA.-AI!:, H; HOLMFF, F; INGI^HAi:, F; WALL'S, 11 

PA (:r>:-:iK} fmithfline leecham foff 

U OmOP,]. A2 ]^^'V.!)ro fi9'^^^'M) - EI] r.Fp :VILn015-31 

r : AL AT BF Cti t'F :; F 
i-O' JE SJ 

- ■ .P^o-.ii Al 19 -.-J-Oo ^ l-''^- ■EI 

" " r,^. - A iLp -O Mv) i 4- 

;?>:.5t Ai ■:A 19^^S-2L'i3696 



rra:;ero;-; iSiod for prt; v^nt ion anc 
h c:;as-;; CM^n ■] un ;:t rvit i s , 



r: p:: (Vr -P. IF IT L2 LT LF LV IT:" FIK NL tT 



' 12M I- 1 5 ■ 



IP' ^ M '^ ^-i A 1 i L^ - - . . . ^ . . . 

ALT Er aLFF^^: A2 E? 1 m ■ ix - :7 C S^; i '.^ ^'.'^ ; '-m 
i -1 " -*■ o ; I. i ^ X J - ■ - - • - - ^ . . . 

PPAI iy:,7-6::;7:.; l 

TC I--:-; C07K014-315; - : I _ _ ..-i,-,,-,^ i.e. 

■ 9 :90'^"::'' "."F:,:\-: , . 1 ;Vp-" ;: . (. . 3.;-- , ; C07K014-705 

V"c07K016-12; ol 2F91 ':-9'- 9^ioFo91-99; - V'' ^ ^ 7 9, , -'r 
r^olI'O : 'Vd I]i.F5 4 h ; '■;F 1 UO ? Fm ; dO IFF .5 :: - 5 :■ ; ^.0 II F ) 3 J- :) o b ; 

G0lii033-F69; 'dO i t;0 33- 97 7 ; G06F019-00; Gl i BO jS-OO 
ICI C12Fm:)1F-09, dlLP.OO 1 : 4 F 

.--r a:.- t- TTrr\Z}. ^ OULn""; ,11 

noVFLTY - An isolated polypeptide comprising an ammo ao... at 

?e;st 70- Identical to (1), a faUy defined 423 Streptococcus pneumcniae 

Pesp^'i^nse P.equlatcr (PR) protein, wfiicT. is pait or tn..: two 

component signal transduction system (TCST3) . ^ . ^ f - >^ hb.. 

r,crx;.TT :,r, DE'--.''^^' ■ F T It)] I - I rJFE PEF'DEFT CLAIPF3 are included for the 
f..IP:,wrn.;ra) an isolated p. lynuc leot ice [ID c^.mpr.smg a r.ucLeotide 



e,.n,.-n-eOa^ l^ast 70; id^nti-al no a polynuc.eot lae encoding ,1 , (-) an 
00F;A9-.n systeiT-, ronprisma Dolynuoieot ide CI); ( V) a hcst cell 
-•'rprFsino ^A expression syst.-x or a moxbrar.e of; 14) an antiDody 



,-sten, cM'iF'rising polynucleotide CI); 
comprismo ^.ne expression systfx or a moxbrane of. ^ . 

;.,„,l.r,o.-r-,^.~ific for (I); 1 ^' ) in .igonL?-. . r an-.ajoni^^t (III; tc (I), ^b) us. 
•-, r • ( 1 I an aaoni t '-^ L '-n '.a'-H'- 1 . ..■ - , . ... ^ . f ^ - / .- - ^ 

Ft ., j^M^-.n^ ^hat -..■■.■1ui--r.-.,- -xDie^sion ol (II); m t.'-ierapy; (7) 

A'paiI".'-.rof (lALF an isdat^d p olynuclo :.t ide comprismn a nucleotide 



.s.quenceA leas^ 9y.F i,,Aa 9a iF F J , a lu: ly d.i.^ed 3453 Ip nuc.e.c 

.-.iv.-T- <n th-- .-.!.^;if icatior.; a;:d [9) a polypeptide which 
9; 3F :"a;. amine 9rxd s.:,u.n..v a-, l-^-a.t 70 i.icnn.cal to CV), a tuxly 



-j.- flr^o 428 arrdn.. a-id protein giv.-n ;n the /pe. • Liication 

^^i--;.- _ FR se-:iu..nces ar- •JS.''r:u:. ici oa aqn. r.mg [iisease <.i 
<.-,:■ -..p. -v-ai -,v t' ■> :isea.-.e :ei.-ei r.-. ;- r- polypeptide (I) xevel:.^ 

1 « r If m-nq the presen re/ •-.b-onc^ ■: t a nututi-.n m the jen-, or 
F..V--s''V9A th^ r-resence or ...cunt of C) -xpres.ed in a supple due to 
^^-F?9:^Jn"with an orqanisn with r.. .^ne :..l.ined). They :ar. diagnose 
■:d-aF and typ-. -f infection. 7ney ar- al:?.' ■Jsel:ui for screening .or 



c L-'i u[. ■! 1 1 7 / .-ii^ ... -^^'^ ^ ^ - ■ - * - _ , 

function by determinir.g the effect n a:tivi-y of ;I) :claimed). 
runction Ly ..^^,...n^^^ -.f conditions associated 

vA';n"Fp' imbalance, and are tn.r.f::re pctentiai antibacterial compounds 
rv.i-.r'U-i^-'tides are useful tAu" .lenetic irnmun i sa t ion and antisense 
sequences are useful for innibi-i:n af expression at (ID- kR 
polypeptides are administered to treat mdiviiuaxs m nee^^ a. KK 



in Fib- 



polypeptide ■; I ) 



R 



4 H 3 4 0 1 



.■:je 



IT'C 



polypeptides {directly <: via i sector i.e. gene therapy', and as 
an antigen for indueing :n irninane re.:p:^n-e. They can pr-vent adnesion or 
bacteria tt' matrix proteins, and are u;;eiu: far use on w.junas 
and body irr.f lants tc pre':ent bacterial infect ion . Ant i- HP antibodies 
inducevi by r ne polypeptide are usefui tb-r [preventing or : re:jt. irig 
infecti_OMS, especiaUy l:...:cteri rL in f e i 'jn ; , .i also isolating clones 

exoressrng r P. Hi, m purifymj one polypeptide Ly attmity 

crirc'ma tt gr-nL'ny . l' l^-'^ji o^.-;-- pitv-m.^-u, ^ -^.^ _ . .^-.^...-i ^.i 

ca..sed by bister nal ^nlection, esoeciaily 3 1 r opt :c: :)ccas pn-amoniae 
inf-.-;^ ions, v;hicn -aase -titis iiitclia, sonj an :t:i •/ i t is , pneumonia, 
ban: or--oia, ^inasLti.s, p.euia] --T:pye:iia , • ni 1: :-a i' i i t i s and e;-;oe::ia:iy 
meri i n :| 1 1 is . 

r; p- T ^ p 
FA AE 

MC CrI: B')4 -SI.).. F; Bo4-F'.ib; i^04-FiJ 1 JJF; E'i4-'';a7; B-j 4 -■■h..) / U C F ; B:)4-...F..; 
Bi)4-GB100E; B04-GFF; B04-G0100E; B04-NC -A'jF; B12-K04A4; 
B12-K04E; B12-K04F; Bi4-A01.; Bln-A'JlBB; B 1 4 - FO 1 ; 

B14-F0J; B14-K01; B:4-L0I; B14-bOi); Bi4-nOB; 314-lJ03; B:4-N04; 
FI4-Nln; B14-tll7B; Bi4-S03; 314-3113; [^:.:;-dl2; CG[;-H04; J05-H07; 
D05-H09; B05-E[I1A1; F' 5-HllAF; Fj5-H1LB; r.Ot-HlFA; D05-H12BB; 
FOb-H.4 ; i:'Oo-Hi"'An 
Eti: S0 3-Ei4H4 

L123 A1C3VJEP" 41 F'F "H- WBI!-: -."(JPYP iGHT .FjIM lFI-F'EFT IMFC^r MAo lol'^ FT J 
AN 1F99-]20j7:3 [lO] WFi:-: 

TI Inferrino the tjiological function :d a protein using a 
lig<ind datat^ase - fr.-.rr. ^ ne function c f ^n^awn proteins 
to which a iigand Finds wnich is a F:- o a5sal:di. rf Fin:lin3 to the 
protein untler exarr,: nat i c^n . 

PC BF4 TOl 

IN IMAMUP'jA, M; ITAI, A 
PA (ITAI-I) ITAI A 

CYC 8 ?. 

p. W( J :? V i ^ U 'i:^ l:n^'Ui±-i \ ± y y ^ j ^ ^-^ - - 

PW: AT BE CH CY f'E OK EA ES EI ER GB CH CM IE IF KE FS FU MC MW NF 

OA PT SD SE SZ UG ZW 
W- AF AM AT AH AF BA BE BC BF BY CA CH CN CF CZ FjF DK EE ES EI GB GE 
GH CM GW ^JP HI: IP :F FS CP EE KG KB KZ FC FK FP. F3 FT FU FV MD MG 
KK MI] MW MX NO NZ PF PT FO PU SC' CE ZZ CI SK CF TJ CM TR TT UA UG 
uS UZ VN YU ZW 

AU B&79:.66 A 1^9-^0125 (199923) CO^BOol-OO 
EP 1008^.72 Al 2C00061 4 (200022) EN CC':BF'-1-0G 
R: CH C'E ER GB IT FI 

ACT WO U'^r:^^0<) Al WO 1 <^*V^ - J F2 98 6 I9^^&0'/02; AU '^079266 A A'l 1998-7 :)36b 

lMqbO-'02- EP lOOHo^o Al EF 199&-92^30F 199^070., WO 1 ■*98- J?2^^8 6 19980702 

FFT AU 9879FhF A Based ■ .n WO 9901409; F? 100&F72 Al Base^i on WO ^^901409 

P?AI JP 1997-177933 19'^707n3 

IC ICM C07B(j01-00 

ICS C07K001-00; G06F015-40; G06F017-30; 
G06F017-50 

AB WF) '-iOOMO'-t A UPAB: 19'.^903Lf:) 

Interring tne biological function activity 'Ot a 

protein v;n:se three-dim-nsional cc n f i ^ui a t i--j-n :s knovvn or tan be 

interred comprises: (i) constructing a database of three-oii: .cM:sic nal 

structures of a lar^ie nnmber of phys ioi'->gica J i y active :omp<;unds which are 

known to bind tc- tiiiget proteins with <r.'jwn convlogical 

function -jx- activity; -lii extracting can::ijo^>- li^anis to :ne^ 

protein (I; ur.ier examination froit tne Lnatai.:ah- and tneir a--.-i.it^ 

to form a ■:tmplex with ; I) is investigatec w:o:n a suitaiide dtckrng 

pr'-gr-im; (:ii) a rari aicat e ligani is ch:'Sen 1 : ::l the iaiat^ase w'hich has 

the best dosking with i.e. forms the m:sr stable >^mple:< with (I); 

and (iv) it is tnen inrFirred that the function 'tr acti'/ity of 

!l) is similar tt that ■. f the target protein or proteins 

r^-corded in the database as binding to the :hjsen ligand. A suitable 



pir.'.qi^Ki Tnx cdrrym.^ jut this mve i jat i.?n is ADAM^^EVE (describe:! m 
W- ^-nl :1785, A':ik:^ It ::i ) which ierer:Mne? nr.e be-.r docking cer f ■jrma t ion 
thv ligan.:! w^th r he tf>rget protein and tn-ri". calcul^te;^ the 
en-igy of i:it er act i i-n betwr^eri rh-;:m in this -:-ni :.rir;.. ticn . 

fj3£ - The rrf.rt-.:.i ic upcd and a::urate-Lv i-fers the bicicgicai 
function c;ctLvity f proteins, u :h a- 
proteins s-f [^revioasiy un-cnsv/n function di sc.: ve rec:i 
during *:he riurnan genome pr j -'-c^: , f-:-r use (fcr ^-i-^air;: ie in 
me *: a 1 1 c in ve s t : g -3 1 i c n s c : dreg i s :. g n . 



PS tPI Eld. 
FA A!'. 

'-i-^^: h-a-AO._; B.!..:-A; Li:)4-A--4; B04-N04; B!'.;-L.-)l; BIJ-A;?; 

B10-Aj^; B::-^:'.HE; B12-K04 
EPl: h':l-t-.p; 7tl-.T:0 

L123 ANTWEF: ^2 73 WP::-: ^M)rYh:d;ir dCdc, [!EPViE:P: iPFPr.MATIOM LTD 

AM 19^-9-1 08 34 9 [10] WPIX 
C'lJC ri.'''99-^)3PP^, J 

Tl ht:.'W JecA polypeptides .^nd pol yriu :: i.-ct ides - i.:s^rPul as diaqnestic 

reag-nts an.i for preventic-n ar:d treatment ■: ± S t repr.o :• :.ccus pneumoniae 
inte-;:: lo-ns, esf:-:-c la 1 1 y men:n::i* . 

PC pa 4 mo 

IN JAW<jP3KI, l: D; ('PWYEP, K M; PHr^PIhid h P; TPA::II, C WAIP-d M; WILDIMG 

E j ; '.V PWPPj-;, X M 
PA (::i-UK) ix'lITHKLIKE BEECHAM C-S? F ; ■oMIK) SMirHPLIME BEE':HAI4 Pit"' 

CYC p 

PI PP 8 94oP/ a: it^^^:ipo3 : 1 9' * EN 3-P:- ci.p:p:;t-3i 

k: AL at be CH ''^Y ; 'E l.y. EC P: PR GB IE I? LI i : LP Lt' !.^r m^' pT 

CP s: ' ' ■ - — 

CA P^3::t'^P A Ipp-AA 0 1 .lato/'c. CI : dP .3- i 

CP illP2P3l A l-^'V9c72I ■ 1 ;^:- ; Pit Ci:CP;i!:-09 

AL'T EP 3;^4'^^7 A2 EP 19y3-3p537s i 9:;'::.i;7^ 3; CA 22 3^372 A CA 191^8-22 33 672 
19980724 ; JF' 111 92091 A dP 19'^b-2P02&9 199-^0^31 

PF:AI US 199^-999720 19:''30313; PC l'"^97-54563 1 3''^7080i 

IC ICM C12NnlP-09; C12N015-31 

ICP; Abli-'P31-7t); AtdK035-7t;; AtlKOYP-CO; AolPO^^-K^; AGlF'039-09; 
A61K039- 39P; AC lKi:)4 5-00 ; API KC'-l 8-00 ; C07K014-315; 
C07K016-12; C12IJ0("il-15; C12Ni:Cd-19; ClCUOVi-pi; ClCtJOOS-lO; 

C12P021-02; C12C001-02; C12v002-68; C01NO33-50; C01N033-Pb^; 
G06F019-00; Gl 1B023-00 

ICA C12P0^1-0P 

ICI ClCtJO 13-09, C12P001 : 44 

AB EP 894857 A UFAB: 19990310 

A secA polypeptide at ±east 70^, identical to se(.:uence (I), a 
fully defined 837 ammo ac: j protein given in the si::.eci.f i ca t :.c n 
is new. Also claimed aie: (1) an isolated f: o 1 ynuclec t i.de (11; (DNA or RNA) 
complementary or at least 703 icentity ^o a pol ynu rlec t ide encoding {!); 
(2) a vector comprising polynucleotide ai); ;3) a host se33 comprising 
the vector; (4); an antit^c-dy against secA polypeptide (I); '3) 
an antagonist which inhihd'-s a-:-tivity or -s-xpress i-:'n :f secA 
polypeptide (I); (b) an is.:date.:; r.-: 1 ynuc 1-.: t :.-:ie fl\d I'omf'risLng at 
I^ast 7::i: identity ti a p :d ynuc 1 e::! i de en.-H:ii.ng polypeptide 
(^^'n, a fully oePined 69? .an:.no ^cid protein given i.n the 
sf e':i f J .::at iiTi; and (7) 3 rcmputei read5il:d- medium stored wit.n data 
sele:-ted trom: (i,. secA pc I ynuc i e :.c i ie^ ill) I\-d cr polypeptides 
i^) / Cill) ; (ii) a set ■: 1 p-id ynucl ec : ides ■ji: polypeptides, where 
at 1-rEist 'ine sequence is a secA c ■: lynu::! e jt i :ie or polypeptide; 
and fiii) a data set repr-es^^nt ing sesA po. ynu ::le-: 1 1 de - or 
polypeptides . 

CSE - SecA p-jiynu:^lect ides ann polypeptides are useful for 
aiianosing susceptibility to diseases p.y detesting mutations cr 
p:dym-:rphisms in the secA gene :r analysing P:r the presence of amsunt of 
SecA polypeptide expressed m a patient snmpile (slaimed). SecA 
PCP f.robes are useful for aiagnisin^ diseases, and -an characterise the 
respcnse ■: f the infestlDus ■■.■rganism tD drags. SecA polypeptides 



;hnu 



4 '-^ 3 4 i") 1 



and rolynuoleotide. ar. als. u,:e:ul f .r r :re.nir, , f . . antagonists 

anonist. and dru.;.. a.ainst .nf.:=-::=u. tm. <■ -o rg. .u ... . . .^a.n., .i.h 

polypeptide (I) .nd -hservii..} m';.. r a .-ti-n ^ ..nd a -iv=.; -r 
xnn-^it.on ..f th-- polypeptide^funot.on^ ^ • .a -^^^ 

a<K-3t3, antag.na.t. .nd io.g3^a:-.-_;.act..^ 

...n:.^ ..q.ence^ 3.-::A .:::rivLtv, tr.e..tcre i.reating^ 
^''^•"'^ -r^'T" ' ..-^^'M , . ..n--H-.-v— - v)n-a:;. ■:[.!.:•./ '::^£ease3 including 

^^^'^^"■^"■^^ n.., .-....^13, pleural 



polypeptides and ant M-'-'nist. are ...--ul :o>: t re- . .: : n ::: - u . 
arro ia'..-d with ..cricirul ^e-A protein i'---^^ .i':: ^^V'^^/ 



9r.t \--::r a, 



:vi-..':ar'-i: * 1 r iri:i esn-ens'ly :nen : n ^; 1 1: is . Vs 



:.;e 



f ■ 



" -:)r i.d^:-5 :an ^e u^ea .1:. :en--fi:.- ^:r,rr .m . 
t r^M jp-y) usin \ zr 



; t ; ns 



infections, and 

polypeptides , 

t t 

on wounos and tody irrrdan^s to pr .'-nt r:..cr er .a 1 ^ .n . 
polypeptides and p' 
(1 1 : f erer:i. 1 a 1 sc r-'j^n 



J ^^.^j ^^.,ilr i ard ; a -.n.-t s ::ar. }::revent 

rr.a: r i;-: pr 

^ _ _ _ '.plants 10 pr 

polypeptides and pol ynucieor idJes n.y a^S-' J- 



adnr:sic.n cl bac:.er:a rr.o:ri:-: proteins, an:; ar.- u.>. t L to r use 

-J*-.',, t I', pr^^-'Z-^nt r\.-! 1" r r j_ a 1 ^n - e' . • -1^. -r^ 

L Hides n^y ai 
' "!" I'; c d s e . ■ J . 'J s 1 n ■ I ■■ c A p 1 



aaqer:t5^ for 
Ln E-T-f:P tc 



;r b . -t.u -ai '1- sue, s.^-A 



polypeptides are u.---.Ji -i ^^-^[■■[■■^ 



i-n-v irih-rr:: t ance - ■ > ^ - • ^':vid:^J thi: 



d.e somput:er 
^cation tvy 



'-■^^^^•■-'■^ ^' ^:-S:;:r :,:..^e-_... ....,r.r..-:^, and als._. useful for 

ynu-l--:-f i ana .i~ p.: 1 yr. j'- ide : ■ ) ; 'iia i: '••■!, . 
F3 c?iO?: 

Cri: BC4-Ev3F; t04-?:'>.; B'M-fM--r b-J. ■ ^ - 

Ei:.i-:i.^:-ACir:; B12-K04A4; B12-K04E; _ 
Bl-;-A01E2; BL4-rn:; EK-FO.:; E:4-r;nl; ^ 1 4 -Ln o; b i ; ..4-d03, 

»,^4-Iin4; Bl4-3u:-:; DOS-.-iJ; L-O-KGV; D05-H09; .■J.-Hii, 
nA-Hil'A; r.=05-H12Dl; D05-Hl-'£; D0:)-H14; D::'.-H1 /A... 

EFd : j-tLiiii4 

1123 ADEWEF' 4 3 OF 73 WFdX^__ COPVPIGHT -.01 DEPWEKT KIF-FKATIOIJ 1.TD 
Atl l'^'^'^-l'!63 65^^ lOoj WPIF. 

ctr.j iii'"vHC,-o.i653'r wic ci':»99-or334: . _ 

T- i;^-wdsolared .gi.r.iAl polypeptide f i o.. i't root ococ-us pn-i-umcn: - 

Ueful ir. d.agnc.is, treatment and pr-.wnt ion -^t bacter.ai infections. 

IN BUFdlHAM, M; FEDOtl, .1 C; JAWOFJt;!, f l-; r.A.LEKC.R, n, ..Ft,..., /- L, tAmtK, 

FA (^Minf 3m:-hkli:ie beecham cofp; (-.i:k) .-k:thfli:;s bfe:hah plc 

Vl^ EP oB9i:& A3 i:r^'.0107 ( * Ell Jlp diy^i^^OLV vr VF NL PT 

F: AL AT BE CH DE FF ES FI F:- ■.-.b b?-- .E iT ^. L-. Lv ..L ..f, Hb 

PO £E ?I , , 

• -A .;3:m-.43^ a 1-.90101 .:'/9:C4; AAd^' A 

;p 111773,:: A i v.,, iV.'^'-n. - /'f:- S y': 

''^'^-''t-A A'"''o'd-;'rJ^^A,A.; .-A :d:C.435A^!-A rv:..-323643f 
PFAI us rj-.0-'^'i37 9 1>.-^7 07 01 

If i -M ■"■1 3;i 3-0'.^; :! .dl'i3 5- :d . , 

yy!;:y,;_;,y yyK."A-o/.; a^.'^: ■: i-3'-'; a3ip ^31-04; co7koi4-315 

?'"c07K016d2V^A3I!03lA3^;d.i;3C:-l.^ 

^il-l::i33-53; 'r3d:A-5^^;^3"::i:3AAAAAN033-A; G06F017-30 
; G06F019-00; 3118333-' 0 

I-I cAk'AsA.', 313P,001:4^>; •3lJP.Cjl:46; CUNCl^-O-- 



FS 
FA 
MC 



polypeptide I ) ':: :[r:pr i se /' i s ■ ar lecuSt 70 
ideritiry tc-' sequence (. ), 6'<1 lirniriLi Ei-_'id^- (^ia) or (4) '-^2"^ .^a, ever their 
entii'e ien'::* h snd/Dr i;:;; e n ■ue*:! by .:i re^:: on.binant p^r^ i yrtu : l.e ? t ide comprising 
-equenc*/ (I: .?I00 op (^j 1 8 7 I bp (all sequences fully 'J-imed in thie 

^fi-ec i f b.:atd ■ )n ■ . Ab^;':^ ■b;i:Tie':i are: ! /O :.ir. i ut: e i pr) i ynv.. L-:'C!.. I'i-' (I); (B) 
-.if. ari*: Llr.i'j :iy anti'i^erdc r i Jiuii ir.':'': pe i f i c t"or polypeptide (11); 

:':) ,a pr'iC'ts.;- f'l-r :! iaqnc-s i r. :i or pr ;jn : ;:d na a ( - us :--?pt i:.d. L i t y "0 disease 
m an iridividual r-la^e'-: t -xr^ r-^-s :■ : jn :u a::tivi".y :d (II); (J) a nie*:hod 
f'l'i' S':^ T' ;erd n 1 t d i':ien':iry ■::':!:up'jur.i:i.: t.n^: a:::iv;:i*.e inrMoi^: th-- 
function ot (11); t E: ; -an a-n^^jrii i. t / arit aq-:-n i s t ot n he arti'/Lty or- 

ru-'i.du'-in ? (11' v;nen pr-:-sen: in : m:,::..;: ir I n^i^t: ; e M ; :^ 
/uiLpi isl:.-^ ai": ve ->:[.'r es s i ■: r. s ys^: em / ri.':-:T.b r'ane expr'e^ 

p r f"-::r r* r : dU'? i n':^ (TI! rr^p^ r i. s i : :uit-..:iri^ in-:- nc^5*- ^--11; (I) a 
pr'j':-es3 t'lu C'r-z^du'i' mu r n-r h'j.z^ z-^il ::o]\X'i i ^ \ r.q :ne e>;v.^r• ^s ^ i or. system for 
■II); (..ij i r*:- r :n.r i n.Lin:: n':st -re L L ^iaLiar le ex^'ressing (II); (K) a 
c':ni|:'Utej re.adard^:- n.ei'l: un v.d t n st'jr'i-ci sequer.0€:3 !l)-(4); .ani (L) a coiuputer 
b'^sed metn:-d i:r r - r f -lurd n':j ;.:m:icqy i 'ie n t i f i ■::.a : Ij: . 

bob - 'I), its a':;-jniits sr- (III are u^e:i to trear ■^•jnd i t i sms 
r'-quiriiUq iric-re aso'i activity or ^xr^ress i ct. f (I) i r t i'jns n^t cited), 
v;bile c-.-n :ii t i c^ns ( pait i..;ul.j r b/ i:ci':-: e r- i a 1 i r. t e^st i 'jns ) t e-.::ui rinLi inhibition 

£ (1) :e *roatcd oy a<.i:r i r: i h. : d n^:: an ant^ujnist, i n:aii.d. t -i-r y :iuolei<.' acid 
'i-r o tir.p'*.- 1 i : IV'.:- polypeptide e,^. prie'.:nn-n i ae infei-ticn, 



t, 



nq (II); ( H ) a 



r-: us^rd t'C' treat i:a-i.i\/ei 
vv'iMand 1 1 e-a tuuenr s t [^rei'' 
)-rel ateii 



d- 



uastiiti;:-. These .ant it a^:-: ^^r : a 1 ag^i-nLc may also 

:itEvi c>^s t': r'ie\'T:nt in f -'Ct i'" ri c^r u-anerally as 
aon-si':r: f h^U'i-teria t: iratrix 



.-no 1 1 



proteins . ( i 

■■ : ) . ( I , , ^r :'e bl rsenljrane.:. <: t i" 
sf:ar.dard rdnumq a-s^y) an^i ■:-elis 
C ) , us-d (1) :o i.^ise Al:. ( r^^r us* 



>■ z^ i -z- ,=,1^ i ti i 1 1 1" ' *■ ^ f I '* a 
y (i) u-i- 1 0'S t .i. no n,-^\:a^ bans in 
■y i':sr *: ►a rnd rd ntr |: :'=■.:'." n ar am^^unt c-t 

a r aLseo zo sci'j^:. t-ar (II; (in any 
: (E) are us^ad t C' pr':du::e ie-::(ambinant 
i n iden t i tying / i ;ac 1 at i nq ( I ) -expressing 



Ciones, f<ar affinity pnji 1 f i .:u t i 'tn, as therapeuti': agent an^-J in competitive 
drug screens); (ii) to icientify (III) 'ar specific :ecep*:rrs; (ill) in 
r.atiC'nal dru-j aesian ana (ivj as irririunsgen f cir vac-aines. (II)/ c-r its 
fragments, r.ire used as ant i sense/ ril-Mjzyme therapeutics; as pr<abes and 
printers t't iscibate h<.'m'al c^g-aus sequer;iaes; ta detect (ir.tant) (II); for 
chr'amos-ama i mappdng; t^a determine l:acterial serc'tyre; f'cr genetic 
iKJTiunisat ion; to scr»>en for (III) -ar.d in rati-anal ■■Jrug^ liesign. 
[■wg . :j / 0 
Ctl Zr'I 



t [■■I : 804-':::: 
B04-N04A 

Bi;-Au 
B I 4 - o i 
l)0 ^^-hl_ij; 
FBI : St' ;:-E1-;H4 



3; BO 4 - £!)SF; BC4-EC: ; B "4 - FO 1 1 i'E ; Bd4- Fb;B4 ; 3 j4-G07Ci3E; 
BI1-C'J7A; Bll-CiJHE:; B12-K04A4; B14-A':.l; 
B2; B14-E08; &14-E10B; B14-H01; Bb-l-LOl; BI4-L-J6; B14-N17B; 
; LO':-':i../ ['05-H'j7; D05-H09; CMay_HII; bd^.-HlsA; 
::):-}^lBE; L-Of-Hb;; B':5-H17A 
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1 u 09-0 
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4 baa [Uaj 
..■5 04 I 
bete rmi na t i an ■: f 

s^-quen-:^*"- cam:: h i. e ra r or. 1 a a 1 ^irpu aa : 
s*:ructures de-areases at each st.age. 
B'a4 DHi SO^ TOl 
r.iEBE, [) A; d'ODDABC^ W A 
.a'ALY) '"^Abl F'OBNIA INbT 'jF TEa'HNOLOb' 



[iNC bl OOO-adObar 
structure a>f protein - fromi amina a aid 



I ri v;r.raan n'^mr^er 



aaniiidat t: 



W'O 084 8^70 Al 199010^0 flcOOCO)"* EN 72;a a01Nr}3--:iO 

F.W: AT BE oH CY DF DK EA Fb FI Fb dB ib! ab4 db IF IT BE bS bd MC bW NL 

a)A BT SD SE SE IV: ZW 

W: AL AM AT Ad Ah BA BB BC BR BY CA CH CN CV CL BE VK EE ES FI GB GE 

GH GM GW HU ID IL IS bP P:E P:G KB KF. KZ LC Lb LF. LS LT LU bV Mb MG 



:;h ju 



Mr' r:N MW M>: N^:- PL F'T R-' RU S[> SE SG SI SP: SL TJ TM TR TT UA UG 

•i:: \':} vu ::w 

EP ■.TT'V-G. Al J0')::'0:GG^ (20'";1.G en GiV.N0:G3-Cnj 

R: GH FP G13 Gl 

ART VJG ■■^^^4:- ''^'i Al Wi'» i ^-i i - -i ]. ; i- 1 . ■' 1. ' ^ 9 ;■■ J 4 1 ; AG .'*f:;7l46b h AG 19Jo-/14b€ 

;LV:^h-4"-- ^77-3--) Al Ei' 1 M 9 ?; - 9 G3 1998'!4::l, WO 1 ? :i8 -UGH f !7 7 19980421 

POT Am'-.)xM4';^/'a Eas-G -i -MtE-^^O; ET 97793':. Al Ba3..d :)ri WO 984 9270 

PF;AI UG 19';'>-:-4 4 1 24 : 9 G":(i4_ii; i.lG :3'*7--i4 :24 :.997 J422 

I-:: '7- IIJ;.' A:- :GJ 

I ':G G06F017-00 

Wv) ;i-4.3... A G?AE: 13:^^il22 

tEy..ie ■ i nr.-.;.: si on.il G .5 - ru ::-:ure jL .^n n protein is 
9^^^^,,.,^ -I Yy; sel^icLinq rini*:-r se". of. t:or:s:.on aiKiles and <geno2:ating an 
e:K-5^-inrV/>:A '■:r)nforKu r-p r-.^ont an -xnaustive ^-nuneraM.:n ot 

se i i: --!'■■'■"' 1- 1 1 '-G t-'a-r, r:E ■ Hit-^ .■ .m i. j. .a-a i. .... ^ -^^ ^ --^ ^ ...^.^ . , ^ ^ . 
GGE - Tr.e method i.^' 'iso-i f-i^r dete rm^n:.n j a protein's 

■ , - 1 .^,..,-,4-,.,..- ^ >— i ^ -i T-v.-, ;^.-'iG s ^u^" t" urf^ . rh i -^ structure 

is .■■)ft^--ri t-'n^ [:ey tc o 1 u ida 1 1 n the function and mechanism jf 
the'protem .G: interest, aUcwin:;; >.,searchers to discove- n-w 

more c-:/tent orugs. 

GL ' Gtr]';'A-";-^ „ :a>=. ^ -uG crjvidto: an acoursite ind easy to u.:-e method 

for' .^L m.^i: structure pr e-:ii i:t i :-n . 

Cu'o . " / '"' 
ES CP] EPI 
EA AB; G: 

MC C?l: B04-N04; P11-':'G?; B12-K04E; D05-H09; 

Go 5 -in 8 A 
E?l: G'.;3-E14; TOl-G 

L123 AM9WEP 4: o)E 7? Wll>: ■G.'iY?I9ET ^^JJl L EEWENT IGrGRMATlON LJL3 

Ag'^^ 199 9-9A3:gM ;021 wpix 

EGN K1999-0 14 ::56 C'GG G I 999- 00977 

T: E'etermmat ion of protein t.i ol ogi cal function - 

comprises use of amino acid sequences dataoase containing the relevant 

inf cc;ma t i . 
l-C P04 TOl 

lA (rVAK-Il) lYAlGJ BUNGKI GEP;KEI iGEGKYGSHO P:K 
CYG 1 

PI G? 10287096 A 19981027 (199902)* 1 Ij. C07K001-00 <-- 

AC'T JP 10287696 A JP i:'97-^2977 1 99^04 11 

PPAI G? 1997-99577 19970411 

IC IGM C07K001-00 
IGG G06F017-30 

AB OP 10287696 A UPAB: 19990113 

A -la^abase C'^ntamin? the mrormat ion for amino acid sequence of 
protein with at least 1 l:ici:.gi.:3i function with addr^o a 

score on importanco of expression oi the hi<Glogicai information tor each 
ammo acid i^-sidue, is new. , ^ . ^ 

UGE - Gne dataiias^ is useful for determination ^;^f unknown biological 
function of a protein or. polypeptide basea on 

ni.- h-.moGtgy of amine. .:icid s-quenc-:., e.g. steric structure of 
protein, .^nd inclu::^:S rerrieva^ an:: evaluation of hi jh homologous 
reGjf w.rc^r ip for the determination cG: mostly resembling protein. 

AL'VAGTAGE - The dat/.base allov;s for rorrect and rapid retrieval and 
P^resump t i en of protein and polypeptide having 
biZ'l'tgical functions. 
: wg . I /G 
EG 'TG EF: 

MC "^PI: BiOi-;:;:!; B04-IJ72; B11-G08; B12-K04A 
EPI : Gc 1--9^0B 

L123 AGGWEE 4 '9E 73 WPIX G'JPYRIGHT 29)1 DEF,.WENT INEORMATION LTD 

AN 1991^-4^7 4 99 [37] WPIG 

DNN N199b-34U768 FNG 01998-133088 



"i L FL^ediction of functional site jt protein m ^De:jie3 - 
ij.;e i :n^-iove protein functionality by mut c: t i :)n 
i .lent 1 f i* ■ 1 functional sit'j. 

DC E:ii Clio TOi 

IN C'.):, H; }-::PAKI, H; KAUAI, A 

PA (['0:H-I) H; 'KA"-A-:J) r:ACAP:" :^IjnToU oHINP'-^^ ■:ic;y:)[)am; (KANA-D KANAI A; 

(MiPA-i) ;iiFAP:i 11; ;Nij->i:) JAiAN S':'i & ■rE-:HN^Lr.:;v r-ORP 

CYC 2 1 

PI vv:.- ^8^:"^9f" Ai :--:^ify")^> ;iy^^^^7)^ ja 'I'-p :] mp-o^^ 

PW: AT PE C'H DP PP PP PI PF. .:-p CP, IE IT LP Mr PT CP 

W: CG PS 

C: P:PPCi''C:^ A \ 1 -^i: AA .AA Jii-''^ 

AAC.A-:. A A- • • :■ -API 'A.C^-A. : :.c A--FA17-iA. 

CP pialp^'t a A.'^'iprip 2;;p :i::p}i^-is^ 

liP] A A'PP'PA ;I-v^P-;_' ].;p ^G' PiA7-:-C- <:-- 

Ei' l''iA:;"'A.' Al '7:pP(.--":. p ' CI ,C\:)i5 -!A^ 

F. : CH PE EP. i:^P PI iiP 
APT W'.) ^y3C90m Al W:.) AA:A-.:p-AA; :1y!P:PPp2; JE AjPl'A^n? A JP 1997-13248 
iPPCPllI; CP ICAP^Pp*; A jF ]C^A^-A:i^l9 13'A':il31; CP LI1E;)P7'; A JP 
IP97-C32100 1997 12IA;; CP i:^:PC;:a a jp 1 <-i9P- Ic C99 :9:-^801P0; E? 1013759 Al 
El-' 199P-9C1095 19^^p!.)Pr)j, WP' 199P-CFA7l:i lAr'A^ppp 
El^T EP i.A37p'>t Ai B^i^ecJ VP!' ;^A-P:P-'P> 

PFAI CF 1 '^^'-P- 1 P t;. 9 9 A:^ v :■ (; 1 'j ; ^7P A-;' '-A - ^APA ^ PP^OIA; CP A^97-1 924 9 

1 P^PvAPP; J? l^fHC- A I P'.: 1 ^ A, _ (A" 
IC [ :M G06F017-00; GC6F017-30 

ICS C07K014-00; CI 2P(A:' 1 >A PA ^ i :P ':-9- P 2 ; 'APIPAA-S'.; C12PC::i-(A; 
ClPCOPl-bP; G06F019-00 
ICA C!P/FP A 4-1 '^P ; (A 7 IC.iI : -0'>; CA^p A 9-C 

I CI C A-iEI P9 : 0( A CIPIP A P - P9 , Cl-^PA'.A. : 1 ClPnC'-"^ *- 7 '"C CI PPnOl : 1 ? 

AB VV: 98 J 3 POO A I'FAP: IvPifOPlP 

Functional si^te c:-P <-i "pc-:e;) protein m -a sp-;Aos A.ir 

wPPcri genomic data c.-l e;-:T: ens i -.^e 'Ali/i ariaJvoLS :lat:a ^6 a \'a liable, 

IS f uecict ed by : 

(a) calcuiat inq the occurience frequency of each oligopeptAde of a 
;sf.ecific length withm the total protein sequences of the 

species ar:d denernininq the lengnh (n; f c r whi':Ai: 

(1) there are tewer ol igop^ept ides ot length {n: occurring once than 
tvAce and 

fii) there aie niore o 1 i^j i^'erA i des of len-:|th (r.-l) occurring once than 
tv;ice ; 

(b) examining the sequen^-e ot the particular protein of 

interest ; total length L) and deteimining tne 'jccurrence frequency in the 
total protein sequences of the species of: 

(i') oligof.-ept loes f Ai ) of lengtn (n-C) c^.n:.aining amino acid residue 
J from the M-terminus cf the protein (v;here (n-l) .A least j at 
mc-st (P-n)), this r^esidue ;i t-:Ang at position i fr.:ri the N-terminal of the 
oligopeptide (where 1 at least 1 at most (n+lj); 

(ii'l oligopiept ides (Pi) of lengtn (n-l) .:leri\^ei from these by 
oubstituticn cf anAno acid residue i in ■: I ig-jr^-p: i'l-^^ Aj by ancAher, 
arbitrary residue; 

(c) calcuPatin.-j the rati.j (Pji) jf the t'ccu r-rer:ce fr'-quency cf Aj and 

(d) calculatin:; 3; me ki v-al^e (Vj } cf Vj L cv-r i by the fcrmula Yj = [ 
V:)i] / (n + 1) ; 

(e) caP:^ulatinq the valu-j :A a function Ci cA Y:; the 
function t^emg a c : ri t ir.uc' as 1 y iri;reasing r ie:rcasirig 
function (a preterreo function is Pj - -i:g(Y7.A and 

(f^ repeating steps (o) tD -e) f:r ea::h v.ilue :i j frcm j=(n+l) to 
J- (L-n) . 

Tne va^ue cf is an iniio-^tC'r jf tne de^-jree c^f contributicn of 
ammo aci:i resioue j tC' tne functionality jf tne protein 

. 2j may c-e; pAotted is a d i s t r it-u ti c n cur-e (number of residues with a 
particular value cf Zj, against "alue cf Zj] and the peak value and 
standard Asviation ^a I -cul at e a . Tnis gives a measure cA vvhi cn amino acid 
residues ^tntribute n.3st tc tne functionality --^f the 
protein (i.e. have Zj itore than one standard deviation abc ve the 



mean) , ani tr^jin this mt ormat i :)n the functional site or sites 
cm be pre iic* ed . 

Also claimed are: 

(1) an ariparatus (e.g. C'jmputer systeias) for automating this process; 



ymes (sequences given m 
yro::o':cus f jriosus DNA 



(^} *:hre- heat-resistant jNA synthetase enr 
rfie sp^'ci f :. sa" L'jn) obtained by m. Kii f ica t i ori of \ 
synthetase at its functxonal sites moing tfie inrormation 
■)L)tained by this process; 

; ) D'lA cooiing for ( .'^ ) ; 

: -i ) ve::t'':'rs connainLng : ] ; 

I-.] t r" ans f 'jrmiint Escnerichii ::oAi coritairiing (4) (E.coli 
HMS17- . OE 3 . ^poP 320 ;r'ERM ,-l^y.-l2., E.coli liM:^ ^ 7 -; ; DE ^ ) /pDPb o 1 "/ ( FERM 
EF^-blfc-) anl Phcoli WMSll A [Dr\ /y)D?lC'l (FEF^M Bt~61'i'))r ana 

■t") a preparation of tne heat-stable f^n::ymes h\' culture of the 
t i-ans t 'j'rmants . 

U.j'E - Tne prO'::ess an-.i the nateiiials ^re used f'U- the predictio 
the functxonal sit^.^s of a protein fr'jm genomic 
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accessibility or protein side ihains - then iormin'g and refining 

iri'idel structures fr-om conf <jrmat ional energies, usei:i e.g. in phiarmaceut ical 

develor-ment . 

DC BO 4 DIO 3 0i TOl 

IM IIC)ELT1!JG, B 

I'h (bO)Eb-I) lltJEbTIMG B 

CYC 1 

PI DE 197562iri4 Al 19980716 (I99.-34)^ Pp '.^0 lb'C3 3-68 

APT DE 19756284 Al DE 1997-197 5 6284 19971211 
PPAI DE 1997-19756284 19971211 
IC I CM GO 1110 3 3- 68 

ICE G 0 1 N 0 2 7 - 6 2 ; (3 0 1 N 0 3 3 - 5 0 ; G 0 6 FO 1 7 - 5 0 
AB DE 19756284 A UPAB: 19980826 

The sp^atial structure of a protein (I) c-f l-iniwn prir:iary 

structure is established h^y: 

(a) det ermininvT the surface a r. sessibil i t y [SA] of protein 

side groups; 

(]:•) det ermi nin'j c-^^n f o ^~ma t i oris that are ic-mfiat it.' Le with SA and 
prrysically p^issible, and 

(r) f.irmmg an^i refining a st ructural mi-ael . 

2A is examineo t. y itiutrigenesa s ario step ( c) invc'l\'tB'S analysis of 
conf itrmati'inal ener gi es . 

DEE - Che metn'id is used in molecular tiic2':i(gy, meciicine, 
L'i'jt e'::hnol'jgy , genetics aria p^narniacy, e.g. t-i scudy irit eract ic^ns oetween 
enzymes and their substrates or inhitvitors, 'ir betv^/een proteins. 
I'he defined structures can help t 'i understanii the effniccs of kdomo lecules 
and tC' develcp pharmaceuticals that niiulate the function and 
f'rop'erties biomc lecuies . 

ADVANTAGE - The method can be used where usual methods (nuclear 
magnetic rescTiance or X-ray diffraction) are not su:. tab'le; it is cased on 
the fact that, ap)art frcin free er^ergy and partial vc. lume, SA is an 
impC'rtant 'ro-crdinate in structure determination. Inclu(iing SA reduces the 
numtb'er of P'hysically acceptable drnf ormat ions so much that coniputer 
m^odelling b-accmes practicable, Gcicd qualit>' structures are prc'duced 



'..^ui::ki'/ withi tev; mater i.ai r equi re-Tient s . 
r.wy. J 

AB 

Lv;4-L:a; B04-N04; Pll-COB; B12-K04; 
D05-H09 

Zl-I: J-^-E:-); S-J3-E14H; T-'H -Jl S 

48 7 ■■ V;PIX CC'PYI^.I fjIIT 2001 DIRWENT I KFOEMATIOM LTD 

I 9 4 4 [ 0 ''^ j W?IX 

DMN Ml 9 ;^::-jS4 1' 'J^ 

TI .^r i-^ ;* '..j: ^ alignment rr[*r*'r.':d to: cor.par at i 'v"e uiialy^i^^i oi 

protein ..c*.; :res - i.,3ir\ :; ::l:L-;..le '..iyr.arriic crcqran-^r". c:. 1 gc^i- i a^Ti . 

LO' TOl 

PA 1 r I ("'[-l-I ] ■ :i X' M(")LECULAR E'\('' IIIPT; 'oEIB-M) SEIB^uTSa PUNSHI P'OGAKU 

1- 'ENr'V-OHt) KP 
CYC ^: 

PI E: -4 ^*P.:C- A2 1 ?9-" CaX: 4 ( 1. 5^ 5^ 3 2 'r^ ) ^ EN 4 Op C0 6E017-?l <-- 

P: AL AT BE (OH DH: 'S'K ES EI EE ("^R GR IE IT LI LT LU LV MC ME ML PT RO 

3E ;;i 

ri- i-:.]p^.:\p. A 1 -^-L-o 1 4 (i..:^-o:;'b) oip ^^ci ijco -.-f,:: 

'ip r 1 ^ ', -1 : : A 20' -^it ■>.;■ t: (2';';^C:1) Ci'A) EC I C 0 

APT Er y^-^^'-y. EP L3';^':'- ^ 5 i:^;oi2E0; jp 10100920 a op io-';-04:p27 

: i'r. 1 . ;■. 0 ; P.;. ':C2: 30A A i-0 1 Y-w - m'^2 A7 6 l^^^'/lpl'/ 

pEAi CP ^;^■:^^:-04"■:'2'' lo i^oij 

RE? rP: -PP.. r-iP' 

K: E;M iCCN^i r ; G06F017-30; G06F019-00 

EA^ '-'A IP' ^ 1-p:'^; G'j.vr-: -00 

A:_: E: ■.■■1 P-. Or An: i C L- / 

Trvi; vu^:- 1. 1 a..i. alignment niePr.'-'-i involves using a dcui:Ac- a;/narTu 

ij.r r^-.r anni:-.'.: a ^ g'j.r: i t :in; . Elitanc*:: (rap-'tff ap pi-oxirnat ion is p^i r rearmed, In 

'wi-i.^cn a sfjn^-re ha'/i.:;c: a f^i edet ei-inlned radius and c:entred c n tne side chain 

'jI a lesi'iae of a protein is ci^i'fined, and residues with side 

■::hains cerA. res that are |:>resent within the sp^here are selected as 

■":-:-nst ■ : uer:' el^■rTLent.s c f a stra'::tural environment of zhe. residue. 

A cun-^.^ff ap-p:r-'jMim.:i t i 'jn i :^ p^er formed f'^r selectively com]oaring 
r-'-sidue pe..n!S ■".O-jt <ainec. h '/ the i:listan::e ■:ut-Ldf ap'ijroximat i cti and having 
-imip-ir l:--.;iP -nvii-^'nment s . 

UCE - 0~'ji:Lpar at^ ve analysis c-f protein structures t -tpitain 
i iA~-ririat i-LUi re':;ar din<.7 structure, function and evolution 
'J t proteins . 

AL'VAE'TAGE - En.jpdes stru<?turai alignment to be constructed 
in reduce^a periC'd oi trme. 
Dwq. / ':l 

EC ef: 

EA AP; i:;i 

Mo: EPi: Ti;)i-..no:4 

L123 A'JPWEP. 40 OiE OX WPi:<: CO'PYPdi:;ET 2001 ['EPAjENT I EE0)RMAT I Ci: LTD 

AM 1 ■;^M;-;-y721 .^'.i [24 ) WPIE 

DMN MIOOl-tl A^ rOJC ^; 1 ^ d - 0 ^ 4 ^ 7 I 

TI E'/al i.r.u str-MCtur'- proteins icy siass i P i cat i-sr: of cod'^ns m 

(::-:'r r'^-sp'Dr,- 1 1 r.^ nuclei-" -j :-ids - u;='.ed to identify inifiortant -m- mutation-prone 

2- gi:ris, functionally similar proteins or ceding 
r^;rgi s-ns . 

DC Ep:i4 pa. T;-4 

IM ERECCO, 0 E; HALITCEY, P 

PA :UMC-!I) ":tMULATIVE IMti'lIF.Y IMC 

CYC 7" 

Pl VJO 0PloHl4 Al 19-^-0^ w (199324)^ EM ^ 6p C:C:7K001-OJ 

P.VJ: AT BE CH DE fiR EA EC EI EP. GB GH OP. IE IT ?:E PC PC MC MPJ ML OA PT 

W: AL AM AT AU AC BA EB BO BP. BY CA CP CM CZ DE PR EE ES EI GB GE GH 
HE ID IE IS OP F:E poo r:R KP. KZ EC LP; LR LS LT LC LV MP MG MK MM MW 
MX MO NZ F'L ET Rt PCI SD CE CG SI GE SL TJ TO: TE TT CA CG EC CZ CM 



vu r.w 

AM H^fsn^^l A l-^'^^'0^22 (1:^;^3'U:/ ; 7 i-"', ... 1 - - i ; ■ -- 

ADT W' - 9H1:--'M Al WO 1 ;V/ L 3 1:^^^ 7 1 027 ; ^^4^0^.n.l /: AW r.A.n' - 7 0' ]. 

I''97 1i'j7 

r DT A" '-7'. A Bace^J or. V/'J ■;^7l&81-] 

i KAI U.; 1 1 I7-70^:7; U3 l:^:-.:-2^72: l-A^tlU2c 

IC ^'M C07K001-00 

r"::: G06F017-50 
AB ' " 1 " 8 1-1 A UFA& : 1 ^ -'o iJ 1 1 

Sv ru':"n u:'^- i" r a protein ( :i ) is evalua7ed by: 

■1) a;-\--i qn.ina ta3-;r i:; the r/:rl.':ii(: ac:i.ci ^T^quF-rir-e (IT) ^-TKrod i [J] 

i; (J r r ij: i ■. s , arid 

■ .i.-signina th--.:^ tripl'^-^^^ :atr^) i c.=.,r of ,-■ cinary rhcice 

a ; i:7\.;ib'-. * o: n deareo:^ .t reedc^n, tn-?:.?f:- classa.:- beir.a generated by 

a} }j1 y.- n ::dnar>' •:\.':i.:.-r rame*: e.r s (v/hiab ar-r functions -jf a 

A:s':- aiainard aie: 

■■ 1 i a ■::la55 -tic-risrarit tat'ie cf r:ea::ert ne i ahbc'iji' fNN) r6:iat i onshipc 
f'.r ai'i.irj.- uaid (aa- residues that prt'V::des, ft^r each class c^f 
c ■ asa-cor;;-^" ant KM, a frequency of cccarrence tor each IIN p^air an a 
C'jI 1 ec:t 1 '"'H '"'f at lf-ci;?t 10 proteins; 

A. ^ (J ]';ach i ne- r eadah'l C' n-ediun. c^n wh.:ch tn*^se tai:'i^:;'S ^r^' stored, and 

' '■ ; ..X i;.etnO'i tc^r ^ Vn 1 a.iit inu a protein: 

■1 r^-s 1 ;5c: anc'"; ^.o evolutionary ot lua: .lit .l«jMo i ;di(^nae by 

1 nv- :•/: na reaitais en::t'ded Ly runs :f a single sabclass, or 

'ii) t '.'r p'r^sen'i'e ol c.ritii.03l a-j Ijy Identifyina rumor it. y codiDns in 
r ans l*: ■o-.ri.-.ns o: n sin 710 cA-^ss or foh'crcie. 
- 7 he iiarth", o ^ .:■ a.:>^^;: 
, t' 1 ti-n*: I : y functionally -^na • r '.ctur :::1 ^y : :r.pt;r i .i.nt 
r • ..: f 1 ) ; 

tt: : tittt :::t oo^iii.o r>o'yi^tni; 
• . J. i ; :oa id-rFitOily st i a ar <:1 an^;i functional sinAlori* ie:S ir. 
proteins / .ii:d 

(ivj ^o identity mut at i':.>n-}:'roine reaiC'ns of viral (II). 

Pa rt i^'-ularl y it is usetl ft-r desia^ri ot new proteins witii 
s-lerted s'; ra ::tura.l sr functional rr:'perties. estjecially 
;: loltigica^ activity. 

AjVAIJTAi^?. - The method can iden:ify structural honiol^igies between 
proteins ev-^-n wnere sequont:e hctjidsgy is low. 

f;;; a' si E?i 

FA Ab 

M(7 CF'I: BC4-EU1; B04-N04; B11-C07; B12-K04; 
DO 5 -HO 9; DO'^.-:ll;:A 
Fbl : Tl -1 - Alt: 3 

Id 23 A[]dW?:R o:j np 77 VJ2i:-: CCiPVFAGHT _001 [lERWENT INFCiRMATbsN LTD 
AN r^9a-S07 7 22 [1-] WPIX 
L'NC C199:::-0a7 4a:^ 

T: oIIA o:i-:oji\:i^ sa 1 :d.. ..n- i Jt 1 va t c d it stass itin channel - useful, in assays to 

iler.tify tjnpoun'is which increase ■: r d*:o:r'ease pot^ssiun 1 coi flux. 
L117 Bo 4 I'll) 

Ih AAFLMA:!, ; ?; BON::), O T; MA7'LIF, J; 2I2VIA, C P 

PA (:CAa-!I) r:"'AGFr] 1N7; (MVtiP.-rJ) UdlV i'FEilOM HEALTH 2C1 ; (ICAa-N) ICA-GEN INC 
CYC Sh 

PI W' ' ae. 171 }s;^ Al laae::)31^ (1^'-E17)^ EN 121p C ' 7 rai 1 4 - 7 7 2 <-- 

PW: AT BE CH DE [T: EA ES FT FF 7-& aP IE IT FE LS LU MC MW NL OA PT 
2 J 2E 2 2 UG 2W 

W: A2 AM AT AU A2 BA B3 FG BP. BY 2A CH CM 27) 2E Di: EE ES FI G3 GE 

2ii Hd ID IL 12 JP P:E ?72 FP r:F. Y:L h2 LP: LR LS LT Lb LV MD M2 MK MN 
MW MX NO t:Z PL PT F.O Rd 27' 2E SG SI SF SL TJ TM TR TT NA UG US UZ 
121 YU ZW 

Ad 9'^-;2 2^'C) A i:i^^2j4 0s [L'^'^h 7 J 7 rd^ 1 4 -7 CC <-- 

EP 943S42 Al liJ^di013 {l':Gdi47) Eti 727rX: 14 -7(d) <-- 

R: AT BE CH DE DK ES FI F?. GB :;R IE IT LI LU MC NL PT SE 
UP 2000214 310 W 20''01031 (2!j0029) 126p C12N015-00 



FLiT 



/\r T 7 ]_ Q R 9 - ■ ' J 1 1 0 ! 2 2 ? J 



FA 



i 9 7 C/U D ; A U '^74260 j A A U 1 7-42 ( 6 f j 
■1 4 1 Aj V 1 9 7 1 ' J , VV' I J 1 - ^•'^)"2 ':• 1 9 :^ 7 0 910; 
J^' 2-:'U05 14 7.10 W W<:) A;^ ?>7-i72 1 " l:^:^'7rn7), JP 1990-0L7028 l^-A'7:AiAj; AU 
7^0I'.o B AU l99'-4.:^jn0 1:^9 7 'J '^lu 



a:; 9 74 2*: 
2 : - 0 ■) 0 1 4 



r. W(9 
-J417; 



^.9 A B.i.-e':i 

B.-^se-:! or, WO^ 9 - 1 : 1 ? 
■v: 1 ;^9'^-4':_3 ? I 
1 9 9 7 0/ 

AA4 C07K014-705; 0'l..MJl:-09 
A'.;; C07K016-00; C07K016-28; 

(:a^:a 'M-.;. 1 ; oaonav-i 



7-0:; G06F017-30 
■ i\ ■ ■ PAF : i J r 1 1 



1 : 1 

1 : 1 



Al B,i3e:i 



^; AM ;A 



E Pr»-vious Puc2. . A7i :^7 4 2(A.O, 
1 9 6 7' ri i i ; 71 7; 1. 7' 7-4 0 ''i ': 2 



1 



' i L^-. ■ , : J - i 



J 1. . 



P': A 



'.;i:a:.r.el (A), v/li-re f.n^ rrjM^'i'm^r r- : (i) h-'i^ a l.ited rtfjl O'.-mI ai' v;ei'ght of 

L-^'": W'Vhir 40 ,iria Aj k7'.-i; [ J 1 ) ha^ a unit Cfjnauct anc-e 'j1 j:i^;!tweG-n 7 a:\d 00 pS 
wrier: th^::.- rric.r.C'mei 1;:. m Ohe functional p-:- 1 ymer 1 c f.^riri C'f a 
[; ^'^t a^c: um 'rha i n an'J is exp'i ^-.^i; ed ir: a :7eri.-;.f:'US i:-:L-yte; and flii; 

^t:'n(:- aqa.lr..:t th.e- AjllcwiKg .:eque-nce, 
11'/ ai:.?er." : 2.rA7._jF2'rPAEi7H00Tl^ F2ii4L''r70jT?' ( R ) 
I'd f at. . 2A in^e':: are: (1) ar. 

]t-a?" A^ '.-orLt i a a'jai-- ard ra:: ar"i::;:- iron. 
; ar.'i CL?r-a >= r va 1 1 ve I y rrL^idified va c larits, 

['.■:je-; (7 I .u.:. l.-Ela.'iaci nacA^Ac a. :■ i r; .-f at 
A:'aliy nyiji i r.i7 sea , u:.d-;r- a t ra. r.aer^ 

- -r..r; (2; A=ol.aOa(.i 

t" 9 seLf^.-nces, 
t ^ :^pec I f ically 
3:"it ilj'jdy reacti ve 



arecifically l:viTida t ar.-: ir 
ivaer e Or.e 0'- 1 t-rrn i aa ^ P i ;;■ ■: 
Ail ae-q-j/.^nae s ate- rii iri 

i. I a t rj^aleA;- 5':'id '. a^-^aJl 

AA , wa:L'-:a i\aa ■ w-r '.a 



. ^ a^"■ r: r 'jv i ;: 



:: . rjii 1. Lr:a, t ^ r 
a:':i:: : a-r^i-:; a a p^e: 



a - 



'>r { A i wi"; i :a\ a^ s a*:- 

r 1 ■TU'jaa ainin': a^'ids frr'ni oxs: 
*:ne Lr ar; rv^t rv :A y r.c : ie^i \'.:;r iaa:a, wa-ri e tae v,:.r 1 
i^-aat, araier i iroTiun'Td '-iqi -I'a 1 1 y reactive a^a"idi t i -jita v/ltn 



*a-.' aTie cf the 9 proteins; (4) .^int ib<j'-:iy arieLA f 1 ca 11 y rea';:tive 

t'!' (A) whiah haa on- ^A ^ a a que a-::^ a ; {0) an exp'resaAia veotcr ca-mp'r i ai nq a 

:aai'a ar-j': :i L n': :Aj aa da.:: ^.r itiOd a;:;!'/--; (oi a ri-jar ::^;'li aranateated 
wit h t \i e ve a. t r ot ( 0 ; ; 

Mar: Tne ant iha-ay ana f:'r-::l'e aan l:>e uaa.d to dete^-t tne preaonco a^f (A) 
oi. (I) in a samrd.e. H->a^ oa-l^a axr-reasAtn (A) ^^an oa '..s^-'-d In a a says to 
I'iontity ■rC'iup^junds whioli iricr-eaao or ieareaae th'=^ paajaaaiani ie'n flax 
tnr-auqh (A). Th-r t r-ans f-ct o-i h.jat aell :an alao h>e ^^aed f::a ana 
r ^■::r^xl: vnant pre Ju?^ ion of (A;. Tne nnzleio acial .eeqi.-n::aa ancc'dina (A) can 
alS': k-e used for determine niatatAan ain tne 2P: and I?: qen-^s in a canir^uaer 
system. The proteins enC'tdovi by the 3K and lA genea can r^e used 
ir. a ':a:>rTX'Ut er syst-ar. ft^r detvrndninq trielr three diiT^ena i'^nal strui::ture, 
w.nioh is useful fa^r d-- 1 erird n i r-:T liqands tt.at bind tC" the proteins 
. (All ci aimo':!) . 
['wq. 9/ 9 
CFd 
AP 

OAd : &:i4-F9I; B-'4-F'.d ; BO4-0Aa; D05-H09; ['OO-Hll; DO — H12A; 
a90-H12:d ; 20 -iiUF; 7) A:.-H14 



L123 AtJSWFP 01 OF 7 7 WPA7 A'OPVPJtlHT 200 1 [iFRWFMT INFARMATA9N LTD 

AN a^';^^-a;079:^ [lo; vjpa-: 

DMC C19 A7-04 774 3 

TI Identifyinq 'jptimal m:iletular structures k>y variaole basis Monte 7:arla 

methoi - pfirt i ::ular 7y to deterniine bi al-iqical ly s::tive c :ai format i^an ( s ) of 
r.K' .1 r--at r aphin :ijmairis i.nvaivei in i;ecept-tr binding ana suL-sequent 
evolution ot active Liganda. 

DC BO 4 TOl 

7N RAOPFDLE, P A; P0S2, '3 M; 2HAM9V2KY, 1 L; WFAVEK, B Y 
PA (T07)H) MNIV AMEBNS Ki:]-0ST7)M 
CYC 7- 

PI WO 9cA604& A^ 19a8(i212 (199813)- EN lOC'p 



ZHOU - ^ A'mOl 



F.W: AT BE CH DE DK EA ES FI EE i';H GR IE IT EE U) M^" MW NL PT 

^■V EE 3E ';g ZW 

W: AL AM A? A'J AZ BA E3 BG BE z ":' GA GH ^G-: 'G ) ■:Z [;E EE EE EG EI L-B GE 

(:;H HU IE IG GF ?'E E(a EE KE P'Z EG LE ZE EG E'l EG EV F-1E MZ MK F'N MW 



MX NO IE: EE EG r < i F :J GL' GE 
V'l ZW 

AG 97^0 17-1 A n--ol^.:Z {I>'-S.:-) 

GG K-Q-:. n M A .> g:'\!0.\'Z {z :00i 7) 

7.ET WZ' '^^cGOZ-E- AZ VjG 1. :'7 -(ZZG] Z E^97Z7 
•a: »G_:-;ii 14 A ZJG r:^ V^--^i 14-^6 1ZE77-7 
EET AZ' ■E/-:Zr/-l A 3,i.:eG c n V;(> '--OEiG]^ 

v^Rhz GB 1 ;i 1.;.- I i ■ . I ^ r -eg: - 1 

EG E:M G06F017-50; G06F019-00 
E:G C07K014-00; G06F017-00 

AB IG: ^E:"G;-3Z a '.'IAB: 1 '■e-' 7 3 ^ ■; 

Z' ::r.: ar.G^:; Mc-iv^t:- Zl:. r E: (Z'BI^G) 

. oer";: 1 : y i riG -i- ^p'Mn.il n.ciec^Ior 
Mirk'jv .::huin with .ir. ir. :tiai h 



G[ GE GE TJ TM TR TT ZIA G'Z UG IZi VE 



ZZ;'^}E ; 1 9 - 0 V < '- 

; AZl Z7:;.:I7-1 A AZI ]Ai9Z-36I74 1^/70-^-71; 



I u ."tar* 



:nipri..; es ( a . 



■ raviai-'g a 
-7.^:ri.:.:.:I ea 



G*;£iriina' a a t. r .1 :::r la re ; ( i:* ; t r.^n^- i ar i rr f , i: y .ir: ani'iunt rand'MLl y cZiCsan witEin 

a pi eJet ermi r.eG rariGa, ^heae variat^ea <-ilcr:g h^^ais ve':zor (BV) '„,o 

pioauc^ a aev; atrU'::tara; {aj c;.-^ I ^rul .it i r^g tha j:>':g er.t. i.^ 1 '-n^i-ray (EE) L-f "his 

aTaa^ai-e; (A) 'le ::; iG i n-; , n " h^: ::a:::a r a " ra^ a re - >;i^G"'^-"'~'^*='-" " 

i: r ar.a 1 t ic ri function, wE.th>i'i C'l r'i'jt t a ^-t tr. i :;- s : ra';-: ^la: e , 

..ar.a i: it ia a ;^C}:Gea -^..ling it : r'-}:G,=va^ 'rr^^ ^>:ia-tiaa .: t ratG.u r c ; 

: eijea* .: r.a (::■ -^'::;), t^-B^::. iiue witr. a aev, BZ' toi a ape::i£Ge<:i aanu^er' at 



eiaf.-e : at ur e .:icc-: i'^i.n'^ n 
; ^1 .jr. a (rG iiritii 

(j) r- ■■'a;!*: .i aa ^.ri'.- ;:'at a-: a £ t-rn.a t a 

aistr.i:Mti:n c-v store-;! --t 1: a ar ■ 
•'•nergy valleys a PE hyc-ei-sui f a 



la) JT'E L'^a^ 1 1 a g 
r a r' r^; sale ft ea aantber 
f -.re.: ; (G ; r er^t-a^: irr:j 



r ep'et 1 1 i t as -^t 5i p^rese ! -*at ^e-G ajt-^er- ^: errit:'e r.i^: ar*:- ; ( f ) tieE-reasLag 

a or eae 1 e^tt a<:"ai i rr:? .a::heGale; la) 
:r. r •■; }:-£■ t i L 1 1 :"i aavAa';; a aew rG 

p re-let e riT'i a-E-:: aa]a.j:>-;'r ta a " r at:t rt' a ;a-:-a'. ^a-r-ed; 

1 1 1,3 h' 1 .j: '.■at'ra I hlj 3 

=.t' aaae 1 e r a t i r:g C'ja fG'rina t i 'ta a 1 
mt'tic'iis and structaral t rans i t i 'jaa , while erasiaq saru'i-taies frcm (h) and 
;ij; (K) rerea t irj'j (i) and ( j ) anti:., after an aj:p-i oTjr i at e nunir^er a-f 
r ep'et It i c-ns, a i es -^G. eat ed it'Vv ^ ^inr - ■:-a ■: aie is r>a:iGhea, aula lag in an 
unrefined gi'th-il ndnimum struct are; (1) retinina this struct are to 
id':-ntify at least tne luteal iTiininium or. ^. he E'E rn/p-ei sar f .jse . Als-:- new are 
;ij 1 ig. unas ;I; th3t Dind tne TrEA .::ind i: reaeptt.ts a-tntaining at least 3 
eleinerits atG e E't bind tt> rinding J^re^s in t:.e r^rceptta-; (Z) liganas ;II) 
^:h.iit bind the aa:iTL]Ti-:'n neurt't rt-pn i n re'Ver.t'jr r>7Z:\l'I'E C'jnaaining at least 2 
eleraents tnat bma to G r-inaing are^s in this r^-t'ep't 'tr ; (?) designing (I) 

■ ind (II) hy :.tMriput at i':)n j I evolution. 

EGE - The nietntd is used tt identity tne b i 1 eg i c a 1 1 y active 
-:v:.nf oririat ions 'jt peptide C'jmiins of li^ianas, p3 r t i cu.l a rG y 

neurat rophins (KT), that hvin^l t-t 5. r-t:epG'tr. ance :5u:'h con f-D rni. it 1 ens have 
ijeeri 1 dent i f : e.:i, s-ei1L r:aG'-tjles :h:it -Mtner mhirit NT-Ciinding ol -^ct .^s 
."ig<jnists 'jf the native liaand c^r; ire :l--veEjped. 

AEVAMTA'.ZE - IZhe Z^BMG is m.tre eZfective tlaan Er; jwn Mcmte 'Zirio n.etnods 
ijetrius^'- (i) a r tins 1 3 1 i''jn ai'tiut tne B7/ ii- LinGtea (iij i:G.acion cG tne: 

basis set is at':G :le:i at nigh tenir-e r 3*: ur -i- E:n:i <: n 1 y i nt r-jia ::e :i E: t 1 aiw 
eiup^e ra t are . 

Er.EEEFG;EZ) ?r/Z7'E3G - BV is -rapDsen r ari :l-:>rTi 1 y or: t:e c e rTu i rG s t i rri 1 1 y within 

■ u firedetermined ran^ie, .:ind the numZ^es :G' r-ef)-E'* i " : -ons in [ ■:{) is at lo^sz 
^lEThe refinmj s:eF);(i:', is r:-jn- I in--a r :ni n i"i : s : tea asi.n^ 3 iTraiient 
des'::er.t, cjn-yJG^te gradient, ga eiS 1 -Mew*: jrG 3n ^ M':'v;t tci -R 3phs tn ) , M3r:5a3rdt, 
variable met ri'.: 01 Powell minind sa t i':>n . Tne m^tl et:u 1 3 r- system p3rticalarly 
ini^Euies at le'ist ta^e peptide :ijni3in aroi tne rjasi.s s-:'t in::l3.de3 

r ta-s i.'jTia 1 ari'-.iies rjetween 3ltih.3 -carnjn 3tj:ns cG" 'i-aE-h aniin;-) as id 3nti 
n-r-i-jhbt^urina C 3n:: U at':)ms, but k-^-pin:i S3::n angles :h3t :iessriri':! r^Ianar 
Odrts of the strucrure ctmstarit . Eg.niiti jns :les siG.cG n :i the 



tenioe rat are- Terpen :i-^rit t ^ 3ns i tier: function 



iina cnt 



3pr)er and 1-twer t en^per a t a res ar-i UjJ'J ;:ni 120 PE To identify a hG.niiing 
i3:in3in, (nearly) optimised sti-u::tares 3f domains in a large number -of 
li:Tand anslc'gues (at least -jne being able to rjin:i the recep>tor) are 
identified, as Iscal PE minima, using the new Z'BMG methad, then these 



i 



4 :^/4ul 



structure.-; C'")nip>are':i tC' detect sti'uctura] f*^a^.ures >::';:rir:;ori to 
ligand-. .'pe 'iiicaLly thiC rr.eth'7'd is app] ud 1 ) ii'ri've 'jrcv 

f a ct or / Tr kA; {t') brain-^ier i ved neur 'jt r : pn i c iact-'^ir, NT3 ^^r 4 
NT?/'Tr-:': ..rid (d) trie re-ept.-r p7SNTF.. r F.EE'F:P.RE:L' MGAND - (!■ 
e l-:-m';-nf. s witfi p-rc'f'er sp'atia.l 'Ciiparicy , r*;.at. i\'e atomic Z'osi' 
zyve a:.'!; 'har-^e 1: c i' del. iriino .a -din.er -:C 'jr...i 1 c-n 1 L'jurc: t i':c. 
the :■ i.d ::'.iirr 1 areas. Tit/. i:.cl..>M;s a J e^:':- i ne- l : c!"i iiLOtii! (LFI-^ 



■'^3-117, v;:th 5 hindinc .ared.-. : 



Foe] 



^r-phoioi c i.r.t e : 
;t ^-ract i on ) ; ( i 

z^C' i-reiy T rp; Ar^ 

r.a r7^:drR civ-E:: 
3iTi, ii: Per t r ae- 7 7 , 
with the CHAi-d- 
ar:<:i *r.I ect i C'St at I.''. 



rdrr-illtA, Pn--'ll:tA aric ThrlMtA ( hvdrort-tic 
.ur^L: Hie 1 A tA (i-r.i:); ( :.) ) Ly^l ■o:.A (i.;rd.:A „ 
multiple par-iIi-^1 be^a -ctr--:rj::i * yp- hy'-rc-v^ 

A-;;;:') , mi;, v; r.n i LI y a t co r^.^i- o : r ..r.c E, -rA.i': 
jiiirilar" .ar.alycic c t birciri-:! a.r'-:u.: . r. T r k <. 
'rr'ie ^p^--:' : i :: t i ^ r: in'.duoe^ <u r.f ^c*' r r r<:'r;j 

Cl^ that 1 1 c 5 1 ^iR..: L >:! Lo L C'Lt i'ji. , v,jr. (;er W^ol;: 
.-c.d r.ydT'.'oer O'^rciri-g ^r--- a •::<■: c ^: r. to.i ■: :-:}A 
E'/.Al'liLL - /-. :?-£-t -ji K- proteins (: a^'ti'. 
'T^rKA tcn.img: w-c ^n.iiiy-ed by t - r::.M.t.i 

tne teririini . The al'j^'ri'Ain :;T^.:'Wo chat th^:- II (l-Il) and C (IK 
'■-■rndrii -yh-! -■dirr-r.: : '1-1 ::J htiF fc.rir a ccii;:::eM :d:ar ir..::d-... 

r-';U':'ri (■-'-11 an-.i 1 1 h ' - 1 ' ) arM tlerdrb:- (1-8), wit a 

ct abi 1 : ir.a racfC'i h'eir.'.;- i^'id';- a v. - P: 1 i ar.d Ar'::. 

ht^r^a rat 1 'M. ■: r th^r twr.- i-:q\oj.s- i/ :.;a,:^;.a i^y r *rt'a L.r i oii h trt w^iM:-; 
;lia:\ Ai't^ ai:d Arqll-"'. ThK- ■> ^ i-e r i ■:-a 1 rr a oceir,er ic reC'O: 
I e':-ei: t iji . I ria-'rti ve proteins 'i^'-y i- tf rc':T^:'r oe^cref; ry of the r: 
1 eai 'tr. . 
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AB l'?9e-Ie:,r,_ 188'dA::i7; bR A.'88-14!A-^ 

IbM G06F017-50 

Ibb C07K001-00; C07K005-08; C\ . Tbj v:,- ; .] 
EP o 1^ 7 4 4 A I.IPAR: I'-'AAA^ 1 6 

AA:.Tit: i f yin ;i drug candid-nte ' R\A jripr : s-s 
r-'presen"..:!': 1 'tn the- 3 -':iim-ns o iria 1 s t r u 
tr.e surfa::e of a kd j 1 'jg i -I'a i ma^-r j:to>] e ::a 1. e 
rri'i'del (AM) of tne functional srru::ture f Bb vdiL.:n may Dk- as- 
t i :lent:. ty fav^jurable :ind an t v ju r E:b i.rAe ro; ::t ions c-etvyeen 
rTLjLei'uIe; (3; i ..ier.t i t y i n :j a m:'ie::ulir trhomer.': (MR) capard-:- 
;-/itrdn Bb an-:l chriatd-r^ c.'*/ ^'^^ trying .A AL-a^:^ 1 a .:os t i t ue nt o. 
Mr' :s eitner sapable :>! ceir.^ oyntnesLsei fr:on reagent ^-i-mfoi'. 

c- T_ o 1. e io; substituted f':irm t :: ^ntr:idu::o iAl i-n iiyritnesis 
c.^r-esent iri ar: P: '; cari^bie af suust l t at ton v;Ltr. b3 oy re-astitn 
F't's; (4) :;er.erat ir.g Jists .if iCcessioLe PC's, su^n that rtm: 
"^tmpounds fr'^m eacr. List may be roiCtei to prt)duce a DC comp'C 
.nmtaining SG's, tt generate a first virtual library :VL) of 
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esf eciaiiy the theoretics! lis*: of compc>un is pr:>iucibie r^y resotion of the 
n,ert::--rs ot tne lis:s, e3::h merr'ber of ea :h .i.st ;• ont ai nin :; a o i-rnponent 
:--^'n:rrm to the other memoers I't tnat lis: aria a ■-■:)rnpor.ent unique within 
lUal list; (5) lirnitinq the nu::±.^'r ■'■f members ot each list using a first 
s-t -jf exclusicn rcles to generate restr.ctea sectnti VL of 'jC's, v/here 
tpetjti'Or. of the rjles invjLveo i nv Iving ii^r e i rtember cf :icn list 
:-..rr;p.i:ri£o:is for (ur.) favour ^ole i r.te act ions between CM an-:: a strurtuire 
■:-':'mpr is i n'-j MF and an S-'^ derivito^ fi' m the uni-qj-.^ corr.pt'nen:: within the list 
or that me:Rfcer, MF anj TM oe — j held in fi:ori spatial relationship to each 
ttner fir the o-jn.pa r i son ; (G; 'rvalo-jtlnj an j ranr.i::j hy O'tmputer the 
ir-'-rr^t'Ors ot the sect^nd VL for f :ur ::ble /unf av-:)Ui: able interactions with CM 
..nJ 'd-ias jeneratin^ iestri:t---i t h : r :i VL )f JT's rin::ed aS n.r/mg 
ro/csracde inter act L :-ns ; (7; 'tt't i ■: n ul 1 y se :.e tt ir.'-: ;;it ...e^s^ i turthei' MF 
fi:)m the mird VL and repeatm-j ster.s (4)-;6) t :■ generate :Ki --ilternati ve 
tr.trd VL; (8) scieeniii.^ the tnMJ VL i.-:.n^ a se^jnd S(;t r {' ex t.iu:? i ar. rales 
tt^ generate a restricted f^urtn VL f -omp junds whi.::n ar^:; tan iidat-js for 
syritnesis and exper irrental evaj.aatit-n tirag e:rica::y; {'-<; synthesismg 
some or all of the ^tmpounds ot the fo.irth VL t-.;. prt:duce a [)C compound 
library; (lOj exper rrt^ental ly evaluating the DZ c<.inpt;ands for Irug 
etfi'tacy; (11} analysing the experimental efficacy data for 
structure-activity re lat lonshif > m f -triQat i jn ; (I.:; using the ■jDtain-'d 
mfcimat^on to se:e-:t revise-.l set of lis^s of ac::essii:):e hj'Z's, exranded 
t- include, and optionally res*: r tct-r-d o excLudi', selectea h': ' s pr-rsent in 
the Lists generat:e'"i iri s*:*;5p [b:; (If) ref:)ea*: m-q steps {6} arti [1 ] ^t 
I'.^i^ntiiy turther can^ji.iate ::otip- Hin-i:; f-.^i' synthesis and ext: t;;r ii;enta 1 
effi'-:a'::y evaluation; (14; syntnesising ana evaluating the further 
j.cipounds; (15) if required rep-atmg ster-s ;il;-{14; at loiiSt 1 time, and 
Mn) L':ient 1 f ymg as lead candiciite a ctmo'iurri syntnesised and 
ex|.)er rmentaliy evaluated as ab :-'.'e , Ais-t tlaimeti -ire : { A ) n :>vr-l active 
cimptiun-is identifiei by the ab.-.'e >:ess, a:ii d-d a method f<..r 
::ijnu f actur irij a ;iru7 c 'lapr i. s ■ n^ : -■;rers [''..]- lo as dto-'e ro.lLUo/ed by: (a) 
m.-ariu f act ur iiij the Dmp-i^urid ide-otiff-rd m s: ep, and (id S'ptijn/ully :::(::iaixving 
the prC'duct w^th at least: li::ari ler t r e:-:--_tLpi'-;ir: . 

A['VAI]TA':3S - A relatively small set of readily synt hes isal.de DCs is 
rapidly generated, witn a high success rate m terms of ::lrug efficacy and 
hence a hi^ijri predictive value f ■.: r dire':ting subsequent itinerations. The 
synthesis is simplified and m.mimised and the success rate is maximised. 
The identified DCs are implicitly readilv synthesised. 
riwa.0/3 
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(Pi)M) -^r CI'-?j:'M, r.avir.q r-;"::-: r ded <: 
2t ":pr.y 1 J :-!''Cca.r aa:t-u.- genome i"iiV..r 
■:j i "6-:". la i 5pe c i f i "av i <: ri , a fr\:: 

1-;.-: -2: identic. --a tr.ece seir..^; 

genome ■: t ■r'jiu:!,- r i;- 1 a a inf^c r'^dra^e I'C 



d naaaleca. id^:; ae';{uera 



e-: a.O':r;e, a fraairiena or a 

a/^c l.t.:at ide £ eq..;;ea-:a;e ol trie 
(qaenaa-a; arai (ii.aj rearieval 



me iina ■:■ :rTL[-'r i a i aa * i.e 

iTi'-.ir..: : : r :-':-ir,r /i r : ag taiqa-t :-e'aae: 
'■Arz\a at:-.r<.i>r ir.e^na ider.aify ai.-ia' 
me.nna i:-r ah. t a ird :ic * he a^'inad fjqc i:;: ae-aaencea. 

^■2E - H':ar.ad c-'.p/ aec- rcnea aair.q ^: ae raE:w 2 . auretaa riJA S'E;queaa:ea allows 
pijt.a^iv-E- functions ^ >' he asai^_:^r:ed so that, protein 

-OTiO'jdi r.'^ or r^qu hat. y reqiaTis (af Liijiumercia 1 , t h^iiraC'eut i c or indaat rlal 

iirxo r t ,1-::-^ .-ar. ): e a^i. *: a.: neo . 2r>- :e f i ..-.^ 1 i y , saqoeno^^a ahaoh ar'o iiaeiy 2o 

'-■ro-'O'ii- .iri^- iqt:']ta hi.jce Ijfve:: ii::erc i r 1 eai and th^Eaa- polypeptides con 

h-E t-C-'O f; a vic.-vr.'. i/a ii.jioa i i a aca.ia.Et S . n'^.r ea..-: i a t e<""r i ^f'. ( hi i ined } . 

T/:-- polypeptides -.^rE Ae 'E oa- a:Eed la a td.airr.eo ki': fcr ohe 

iita'/anC'Liea ra la n h.aureua iri a aanijEile. S.au.ceus impdioated in 

nar:.'E-i L-oi haiLar. a._ aa a .-.a-a , in:.lua:iitq '::ad lul i t .i a , e-/^.ic i nfa-ct ia^na , hood 

}Ecq a<: n 1 nt;, a a t ^e laya 1 1 1 a , akin a]-xJi at^Kjical wa^Eind inreoaiona, scalded skin 

.-yn:-r::ir.a; t:::-:!.-^ aaaq-^ ayatirara^E, -to. T:a:- t r ana f o.a::.ea o .la;^ c:. n i -a:, o c.^n he asec 

[':r I a ■• EHii dri.int ■ tM a-a; j<Eq- i 'ETl a r tn^- polypeptides. Trie nev; UNA 

-■'-■[■ .':r: [-v.o 'Ae- ■ r that qina-nt .E ; .:r:-E: t^aadul ^a ie^i i.iut^i ^. or iE^r ■E'iE)'...'a: .Ajo 

i a'-d. a' : O';: h^: no i e i.^Ea.' s oz ara/ ad tn^E 2i?l S-at-'-^-.a Ee^yiencea o-iaiia^^d) . 
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d-^t7 a: V;(j Iv^^t-dPlGltP 1 -'^5 121 3; Ad 4 46 3 4 A Ad l?-:ih-44e84 

la; VP ta40Pt7 A t'G : ? 4 - 1 P5 - 4 4 1394 1014 
:.4 4eP4 A Based or. WO 9hlPdP7 
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. Jnl . P>;:dd do 4 Pt3:-7 1; tip 4a -^aPPP; UP PE^le^G 
:GM G06F017-10; G06F017-11 
:GE G06F017-50; G06F019-00 
W^:' -G1.7 -H:.7 A :.1PAE: l?^^-o~ : 1 

Pneo.i'i-t in g the ten'jen::y a protein to Pc>nir-. eitner: (a) ar. 
aiTiprdp-ni 11 r a 1 pn a-;^ : ruot nr - ai ipp an aniph^ r-'h i.l i: ;.>e: a- at i-^(_;tai - 
oompr i se;- : fi) ji 1 ■ -al at in<^ a st-rie.^ ot. vala-s far Pa) d-ali:4aa-x using 
-^'juari.jn (1) or [b .■ tl-P'e*:a-x Ooirig equaaion [2], tor a serlfi^s f t:-jrtions 
:d r n-j protein, ea -p p':-:ti-;>:a h.iivin.; .a span ;■: residues, v;he:re 
the series ■:)!: p'lrt-oris ^.pario. the protein aii':i x is -in Lriteqer: 
Eqaati;>n ']} i d-.:lT.)na-x -- Hx f ni..-a Itiha-x - [r'a] .:ind E^gnatis-n (2) is 
p-i:)etJ:-x - Hx + niu-b-La-x - [^y. ] • In eqaatijns (1) arp:i (2), Hx is tne 
avera:ie nydr-jph-abi'id ^ y f:r ii span :>f x res^'iues as . nq tne rP/te - [''jcd i 1 1 le 
S'Siile, nu-a U.ina-x is th^^ ny^i r^ip-n ao la ni'rnen* (span x) f-ar a 1 pha -s^ar a: ores , 
the angle n-atvyeen -.t.-^ r-'Sidae -invi tne sa'SC-ssive r-sidae oeing IXieg., 
mu-l:>'ata-x is t l-.e h'/dr aoh :'P'i :: no'iaent (sr-an x; for' P)--'t a- s t ra snares , tne 



'niijio t)etv;eer. ■sr.e res i iae ami tn-a sa 
[cja' i.s the r:):ioiti_'n iera-:. nde r.t ti'arn 
vala-!'S t'')r n-ajpria->: ar '.'-De: a-x ^ra_ 
■■;nere (a) r>eal^s v;i ie enj-^;ih :.o sarres 

to oe a Ion a- a r .sat are s ani [o] cea-is 



cessiv'".^ r--^ i iae being IcJieg,, and 
r span. :^:ty, ar. 1 ' a id aepia:tinc: sne 
hiaally a^^ far::, a seri.es at peaks, 
pan::i tc- a segnent of the arTiina acid 
rane -jo a:i a Lf ^na-ne 1 ix are preiicced 
tnaa are t co narrow ta C'Srr-esDond to 



a segment of the a:ninj aoid segaenoe Pong enouah ts) span tne membrane as 
an alpha -he tlx out vdiich are 'wide ensu^n t 
amino acid sequence with a length between 



^, ^ 



a segment ot 



and 14 a::i:-na acid residues are 
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Analysis and mo di f i cat io^n of protein stru^-tuie - by 

.'l -jent 1 : 1 cat 1 on .:f ravourecJ -ind suffressed p^atterns -jf hy':;rcp hcbdc amine 
a'Vid residues. 
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Vv>") 9436783 A i.ipAB: 19971109 

(A) Identification of favoured .-..nd suppressed patterns of hydrophobic 
residues from the group Leu, lie, Val, Phe and Met indicated F and other 
residues indicated comprises: (y) providing aminj acid (AA) sequences 
fi:om natourally occuiiiny proteins; {bj ap-pdymg tem^Dlates ol 3 
t-:' 9 positi'i'ns comf.vcsed of all combinations of. F an^il S in each position of 
the ammo acid sequences of (a); {■::] scoring the frequency v;ith v;hich each 
template is found; (d) determining the standard deviation (3D) by which a 
template occurs from the expected frequency :::>r an -moirical distribution 
C'f F and S p'jsitions based upon the frequencies of the AAs j.n the 
naturally occurring proteins of (a); and (e) ran.^:ing the 
patterns on a scale of preferred and suppressed on the base ■ 
t'Otween observed and expected frequencies. Also ::iaimed are: 
Eilteration of the structure of a predetermined .V. sjps.lymer, 



JS 



d: those 
(B) 

comprising : 
(b) 

prjviding tne i-aentity of favoured ind suppressed |:'atterns "if hydrophobic 
aa residues as determined by (a)-(e) as in (A); (c) lientLfying patterns 
of hydrophobicity in the provided PJ\ copolym'--r ar. the basis 'if {rO- ; 



a J providing the sequence of PJ\ residues in an kA cop'Olymer 



residues to patterns whi'::h 
jssed; jnd (e) -iltering the 
:t a structural ;^hange; ((3) 
he poc I jf ihydr'jphobic s^de 
'drop hc-r-i ^ si de 

; a. ' pr jv^ :iing X-ray 



cc'mparin? the concc-rdance of ^:h- ooserved F-A 
have been determine'^ tc- t^e faV'Mre'i or suppr^ 
i-ientity of the observed aa resiiies to effe<: 
icentit ic^at i'l'n of the extent and density cr \ 
■idrains in the c-: re of a protein int-j which n'^ 
':d!ains of an alpha-helix are imiiiersed, ^ jrr rjr i.: Ini: 
CL-ystallogr ap'hic coordinates of naturally sccurring proteins; 

ir.) appdying the strip-of -helix template {X X--X X--X , joined in a 

circle) to each of the alph^t -h*-.-lices m the proteins c^f (a^ to 
maximise the mean hydr^ip^hot i ci t y of residues ^n X pssitic^os; (:) extending 
tne ternpdate patoern aefinea by the application m step- (b) to adnacent 
non-helical regions; (d) determining the t em.jdate-speci f led positions from, 
the gp. cf (i) <ail X positi:ns n tne helix, :iii tiie N-terr:dnal X 
position, (iii) the :-terminal i: position, (iv) the X posit^s^ns in the 
helix between the N-terminal X [osition and the C-terminal i: position, (v) 
the X pcsitic^n with the smallest residue, (vi) the X ocsition with the 



least nydrophobic residue, (vii) tne rirst X position beyona the helix at 
th'"i N-teriTunu;?, (v:ii) tr.e first ;•: ptsitio^i neycnd :he helix at the 
C- r ermi:"iUS , (ix) all nori-X p:>SLti--m3 i.n tr.e he^:. x ar.i \x] tae X posit ion 
c-jMtg. one of tw:. sequential resi lues in pr:iiriary sequence fron the gp . 

Xea, He, V,ul, Phe sno Met; ie; ivter:uininj the L::iestLty -.f all A/.s as a 
function tne ai^tan:-^ their C- ulfhj rrc-ri the ■ii. s'oil 
n-:.n-nydr':-'7eri at-orr. of tne side .:hain of PA.-: m ^:he t-rmpl it-- define:! 
ptsitions ar.d (t) vlet er:n i ninq the mean density, osrn ] I at i v--; hydrcpr.tbicity 
and shell i-iydroph ji; i :i ty ::f a :h residue il--nlttie:l in .-.■■p ;e) around 
ei/n teir.piate-defir.ei p-:.: ititn; ( !■) alteriiri the .3 1 :. uot.iiro.; d1: an i Ipha 
-h-Lix an-i/or altering the straoture tf t.n- apf o, s i t 1 Di'ia !. .ty Ire ph-thi c pool 
t tnat n-hix, . . pr e^ie t er ][;ine i AA : ::>]:j: ■/]- e: , oi;: : r 1 s :oi : {i,) pr:,.7id..ng a 
.o='{jen:e AAs frxn a f r e iet errtine :i A;^ :: .-rx.: lyn-r r ; .0 .■■■t--p dt ) - ( f ; as an 
(■A; (oi oomp-arma tne o:: nco r ::i..ino- ■ :> f ^he ■)l::-sf-r'.-ed 7iA i-osidues in the 
ptsiti jns L't the ."o: r ^}:,-^.f -he 1 ix t-saijl e int.' j ;j 3 ; L t i.-jn,- ina in the 
aprj::)si tiO'nal po:l :: f resiiaes tO' fatteriis v/hiol- Irr/- ne^-n determinea to 
inO'St favo-ijr or res': sup];'re;:-s neli:-: ex t -ns o-.'ii ; arid ;di h.i. t- r ltig the 
identity of the onserveo AA resiiues tt- effect strjstural change; (E) 
preii-ting open rea^-img frames ( Jl-Ts ) in l!1A segaen.;es, ■:: :-i:ipri s ing : (a) 
pr>tviding a DUA seguence tt- be analysed f'::.r the presence f 'JPE^s ; ( b ) 
rietermining the DMA cttd'tn puattern c^: rr esp^-ndin'^r to JFPSt' .-.nd SFSFS, the 
respective LIJA ■~:o-i^ s h^emg -nor.T-- r--T--:i-. riT--r-: nT- inc 

-n- ov:--T--n:jr:7--T--nonT-, vmere T is n d-- :-:y r i re nu :: : 1 with tne base 
thyiame md non": : a d-cxyr lo-jnu tleot icie witr. i: b ;is-:^ s^lv/cts-d frtr:i 
aiTn:.ne, cytOis:^iie, .^ianme an-l tiiy.ri-ne; i , ss.a...aij '.he e -ca rr ence of each 
pittern withm rJi^ 'eJA se'^uenc-; (d; seie':;ting i:'Olyn 1 et) t ide segments 
v;nich are inound-ri h y LUA nucleet:^de patterns r res::> jn :ii t :■ this SFFS3 
aa pattern and vvnirn hold '.he ;lFFf3 pattern mtre t!r::n tne fd^ frt)m "^he 
emf'irical frequency -^f that p:'atte:: n in a ~^an:t;;rd s---t :: {•\]A segaences and 
wnich exc^aie tne fFJFd pattern :a(.aoe t::a:i one fr.oi the emt'irLcal 
fr-quenc:y :.d; t rret [.■att'-trn in ,ii standarc tf "AlA ■ ■ - [...-n' -es ; and e) 

id';nt L f yirrj s-gmer.t3 ct)ntg. tne ^ndiviaua^ c^r ni-.^rge«-: p-js i^i'^e selections 
as the preiicteo. oP'Fs; (F) predicting ef .::lj.'h3 -nel.^;'-3, coimorising : (a) 
providing an AA sec:uen'.:e to be analysed i-jr tY.e oreser^ce <tf a helix; ;b) 
sct-ring the segments m vjhich "he oFFSS tiattern is fjunel; (c; extending 
the scored segment t'jw.iird the h'- terminus ana/er the terminus ro include 
a^iditit>nal residues which fall withm favoured comtjrsite f^atterns vjhicn 
are identified with this invention to include the hi'.ihiy fav.Dured pattern; 
id) identifying mdivi^iual or overlapping segments .1 'It-nt 1 f j ed in tiiis 
manner as the predicte'i alpha -helices. 

The methods can be used for the analysis and mc-d.i ficatioTi of 
protein structure. They can t^e used to design polypeptides 

de novt> 'tr tc- modify natural prods, to provide the KMst effi^:iently fc-lded 
and stable forms ef: AA copolymers. The tecrmique Ciin he applied to drug 
design . 
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ICA ■;^.:lt]'933-9 

AH e; 9'-^ 1 L ^. A v?/-i- : 1 - : : 7: 

J'- ;^ I ooer.t ^nt i r> :■ :iy :2±>, cr r r^g-'-rt :. y i i.:" f a :i C'cmprib^ea: 
■leteiian. :■ t tr.e :: :-n r-: rn t j r.;.: rris^ct- :re :■ r tl:-;- \'^iri :.bi- le^ic^ri of the 
r. ■: 'i^\::.r_ AL c-r 1 1 <.3 :::in,.er. ^ by : ■: t. : u :;t. i : i ; a m: trie r :jd-:rit At' VE:riable 

r'-^r ^t) -j-rne r n ^ i ri ;j .y --:ii>:-n'.~v alignments fr::r^ rel::ti.--e 

revolt: i .1. it y a., s 1 1 _ Liut .i i i .-u-. y^tnv cr y;::t a _ ^ -:.>]:: i.c ;e t ri,. ::t u res :t the 

r::-:ier!t Ab variahle regior. neavy ar.G light ohains :ii.ve a ^iet of he^ivy 
ani i:i;jht chain frii-iewcr>: posiM-Ln^; ;r) "Mjfinir.g a ;iet f heavy an.j light 
chain surface e:<p':'Se<i anirja ac ..-li resiojes asir-,: th- s^-r r.f f ramewc r .< 
r^asiticns generates m (t.) ; ( 9) i^ien:: i f yin 7 frc'ii; nunar. aritil:c:iy aniirc acid 
sequ-rnces a set --f r.-avy an i 1 ig:.t. ch^^in ;;iar:a--:e e>:A".-.;o am-a:- acia 
i---::idijes -h^)t is n.^'S^ c:i':-^ely . o-nt l : : . i t-. tiv;^ .;;-et t r-ecidcec defined in 
'■-J ; (e) :-ur-'C":it:ij- i ng c.^r .f hea'^y ar<-: li':irc:: ch-;ir. surfa:e exp::::e:i 

amr..:- ,::;::ic;:; r:eaid.jec deLined i:: { ) u^'di the se" <: i: ::<:-;: iduea i:;entifie9 Ln 
hJ) ; (1) constructing a inod-I ..d ij-.e ro^ien^: antir::-:;y van ijcde region 
resulting frc-ir; the subst i ti:r n i (g. :^dentiTying anv airino acid 

r^siaues f.rrju th*- cet i ci-nt i_ f i - a in ■ :b , *:hat i.:ie v,:*:hin 9 Anastrcrr t cany 
a^-:rr. cd' any rfi'SCi^ue c f the cC'ire UrM^eritarct ".■ det^^rrnn. rccicns (cPRs) cf the 
^' ^-^^-^ 9>:: be hcir.anisea; and (h; ch^iigmg aiiy rescue- laen^ir^eo .li"; (g) 
tree. hai;.aji t.::: ^:hf: rui.g:_nal rr-)en^ ani?:*;. ar:;a recr^e. ;d;ep ;a) n!:?ed 

n-t: h.e c-Miiucted fir^t, but muc*: t^e canducced p'rica: ta:- step (g;. 

USE - The humanised antib:adies permit the prodr.. of 
functional aitered At'S, v;hich retain the L^mdmg f^ararreters of the 
original non-human antit-c^iy, witti impiioved trierapeutic erfic^ency m 
piitients due to the presentat i-an '-.-f a human suriaoe an ttie variable 
regicTi . 
Dwg. 0/10 
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A method for producing a humanized rodent antitiody ar fragment thereof by 

resurfacrng, said method consisting essentially of: 

(a) generating position alignments from relati'/e 
accessibility di s t r rout i ons fro::i :-:-ray ■:,rystallograp[iic struv:tures of a 
po^d of rc'dent and human antib<ady heavy and li-:(ht chain variable regions 
to grve a set of heavy ar.d lignt chbin variable re:r..:.n framev/ork surface 
expensed p'Csitions v;nerein the alignment pcsitions f-- r all. rc^ient 

and human variar^e regians ure at lef^s: ab.jut: ■=^9?, i i^-ntd ca ]. ; 

(b) -iefining t-ar a ro.-Jent ^ntibo-iy di fragment rhLere:>f set of heavy 
and li'iiht chai.n variacde regi:jr; framew.ark surface expase-i ammo aci'd 
residues using said set ■:'f :\e.=i\--/ an^i light chain v^riatde reuicn framev;cirk 
surface expased positions generated in sai-il atepi (a^; 

{■::) i :lent i f y ir. :^ from humar: antitoiy ammo acid sequences a set of 
h-...vy ani light 'jn-iiin vat. iatle regicn f"ra:rev;ork sui:9.::e exposed amino acid 
r-i'Sidues :nat: is mcst closely identical to said set :-f rodent surface 
exposed .v.uiriD acid residues de:med in said step (k ; 

(d) suL'St itut ing, in :he "r.rmhle region framew:rk amini acid 
sequence of said r-odent antitc-Jy -r f i ^gment tr.erec>f, said set af neavy 
and light :hain variable regian framev;irk surface expased amine acid 
residues defined in said seer- la) witn said se:. -af neavy and light cnain 
variable regian framewcrk surf::ce expcsed amLino a-id residues idenaifiea 
in said step ( c 1 ; 

(e) canstructing three -di::-.ensiona 1 mi dels of said variable regicT: cf 
said ra^dent antibody or fragment thereof and of said variahde regicn if 



Sclid rodent aritioody ur lidijiiLenL Liieiec'l requiring rrom tne .'--uDs 1 1 1 ut ing 
s.p-^cified in said . tep- (dl; 

c T'Tiipa n:; ^ vaid r t-;ree-ai:T:^rn3ior".al rnC'd-l.s '.:on s t r uc t ea ir, :^:iia 
(e), and i':ient 1 fy in^:;] any amin-j acid r*r^-idues tiom said sets identified in 
said stecs (tO -.'i^ ' ) / tnat ar'e wi^hm 5 hv\''.:strom3 ef any st'i'iu of any 
r'esLdae C'f the --cirriT d enientar i t y deternininq :^'"^ion-:: said rr;.i<rrd antibody 
er fra-jment there-ji; 

(g) changinq /uny r-isidues identified in said step (t) from the human 
^ tne "■ri:jinai r:-^\--r'r -irain- i-i resicu^^ " tfiereby aef.ine ha^ianizing 
iz'T-: of SArtace -xr:':sh-d .-iiriin^ ..;sid r*.i;; i ciu-s ; 

in) repdacmg the set r^jdent ant il'C'Ciy \^5i^ial". Le rei^ior. f ram.ev;«:?rk 

hur:iani:::ng set ■ f "a:r i arvl :^'gi■Sl t" samew^.^s < >^arfa':-e e;'g:-:s^eci air.in<j ^^ria 
r^-siLiaes detine<.; in sai'ii st-.p ('.-'^ '-"la 

h.:iving ::in':iLna sfDe- d : i . t y . 
r.wg.O.IO 

L123 AMoWEr t OF 7? Wid:-; Cn?':}^.:G}i': l-'i'A sFr.WEltT T rh-^(j?.MAT I (tM LTD 
AN r?94-n:'3r^C [lOi WldX 
CP 1^ ':0-'il81 [02] 

r^KN Ki ?'/4-fa:,4 -H'.T Ti:!: di :^.;''-;-0 B-vd. 6ij 

TI r rech^r .ut ^ eji rt ■ inp i ^'Ve'ii haid. n; . s* -d <;int li: om. e.:- - ;:.'>' i: tsa r i sci"; cf consensus 
■ir:a iri.p'Crt i:-(:rivpd eri* -nt a rc ty '"i'"-"^ laair.i iig r'^a.i<:ris ar.d 1 r ai^^-wc-r k regi(*>n 

i'C Bd-4 I'Jl t TOl 

IN CARTEP, ? T; Pr.P:?TA, L '"^ 

FA (oETi-l) eEIIEfiiEdH 

CYC ::.'-'> 



' 1 



14, '::IP ot Wt* 

Id -1.; CT A CIT' ot [J3 



T...i,7 . /\ 'T L.;-- (''LI rjir c • r-::- . ::. f ^ rn - 't r ^r-' j-i-^ y^- 

v;: A'; ':a Jz 

At JrOiidAC M i;di'lO::i5 (l'':*4..": 1 21 iO i t- 1 

■j:2 ^821327 A la?;:dO:3 ( 1 '.^ -1 cd -22 7 1':2 1 ^:.-0 1 <- 

do ^^0242 27 A 2\:2[V:i4^2 t2-d;22 7; Aol ?2;-2'2-2 22 

ALT WC 24a4 6^'2 Al W'.' 1 '^^ 22-tiS7 t. 2/, i2-*-'0s20; Ad 2:d.02 2 1 A Ad 1 2 22-: 06 31 
1 -j^j^ :.0r ..2; II::. 22.'1/::/ A '"IP t d:; 1 'r^'^I -7 i ^ Vy ]^21i 
1 ■^■r^y t: 2 1 2t:- 122c0tit, i]^. 12a.: -2 ^4 2^: 12 222!:.:.!; d.:. 
1221-712s7s 19210C14, CIP lT VJO 1 ■.;^22-m;: 1 1 2 2 ]222■:t^l!:, (:--jnt cd UC 
] 9 22- 2 34 27 3 1 9 920221, dS 1992-4 '.'' I 042 1 99 2 1 2') 9 

FET Ad 9:ca;K31 h Based on VJ2' 94t:4*:^7a; d3 ^d::;:42;^7 A Z'lrr. c-f dC 2221327 

PPAI dd :9';^2-934 ?73 l;'92':i-yl; dd r;^'"^ 1 - ^ 1 2 ^. 7 2 1 ^^'l^ 1 2 ^id ; WO* 19c.;.-U32126 
19 9^:')0:5; '22' 1 992-4 ^7042 b.''9ja^'td' 

PEP ::.Jni.Ret; ViC) a027 201 ; V;i2 ':^c2 2 02:: 

IC ICM AOIKOB ^^- 2 92 ; C07K016-00; Cl-MOlP-lo 

ICS C07K013-00; C07K016-30; C 1 2 II02 2- 1 J ; C 1 s POy 1 - a : ; 
G06F015-00 

AP v;ij ?^4ij4 079 A dPAn: 2tinO';)0'''2, 

A method ior makin; a humanised ant^tcHiy :A]::d juxios-d an amino acid 
se-'-iijence ':)f a ri-."'n-hum.^n , iiup^'irt Al:>, on<:i .u inaniar. Ad, C':an;:u" ise.-; tne step)S 
of: (aj ibtaining th- A/^ se-p:ence a*' ^east a p-^r^ior. >jf an impc I't 
v.Tir-i i::.le d^j.^imin ar\.i >i ■:>-.'ns*:'nsus human -/aria::de :i':'m<j:n; :hd L:.lent i f ying the 
■::Si:ap L^mer:ta ry d-t-rnu n ing r'^^ui. jn (CPPd seciuen.;'es in l^jzYi \'ariat'le 

s-;: i^uen ::es ; i :d sul:)S 1 1 1 '.it i r. :t :in i:'ipvrt oo:\ 5--quen':-e tjr the co r resc'onding 
human '2dP sequence; i :i } ald._pnin-.i tt.e Ad seguen::es ol a tramev;C'rk region 
(Erd ot tn- im[:c'rt Ab .Lin:i tti-- Ed :d' -h-:.: census Ab; ' ) icient i f y i n ;j 
.imp'jr" Ab EF: resi-iies Ln tne iii;i\ed EE sequen::e.i tn^t :;re non- 
homologous t :) the -:-jr lesp jnd . ng cjns-nsus Ao resiiu-^s; 'f) 
■:ieter:Tiining wh-ther "die non -homologous inp'Ort .^J\ 

resid.^e is " reas jnard y e:-:pe:;tei i nave :ui- ji the f...ilowin;^ effests; (1) 
M'jn-C'jvaieritiy oinus anti:ien dire^tiy; 12, inter^rts v;itn 2L!R; sr (3) 
part icipiates ir. the VL-t'd: interfa-::e; ar:d -AO for any non- 
homologous im.pirt At) P^. I'esLdue v;nic:- :.3 leasonaoiy expected to 
nave .-it least ■me 't Cih-se effe::ts, s ur.>st : tut ing that residue for the 
corresp jn iin g A.A residue i.n the cjnsensus Ab EE sequense. 

A humanised versi:;n jf murine monocl^jnal antibody MuMAb4d5 (ATCC CRT 



13mC3) pi'^vici^a v;hi^:h retains tr.tr Linuing ^fpeoiluity C't ^;rie pai'ent bur 

is r.^'D- inirriuri' 'OeTi 1 c: m ;iunians . T\--^ p<.ire:it Ab i :y dlso ':Mr:ec^.e<:i to the 
ext locel luia : domain oi p 1 8 3 ( iibl^..^; ) ,:^nd ha,-; p'jteritial lor :ro-itment -Mid 
imaging ''^f e.g. hitman i:)rea.= t aniJ c-varian c^^ncers. (The s ing 1 e- s t ep* 
humi risat . on i .r .--hovm i r. tne fig.) 

"nE/ADVANTAGE - Tne metnod prt>vides ^'ffi^:lerlt means ui hnmanising 
ant it-'j'i i'.-s , a? to retair. :.inprove th'.; affir.ity of ^r.'- non-hu:nan 

'Icniir an*, i h 'jiiy t-:r a a i ven a r. t i <:]■=■ n . Th- aritibcadle? are less aritig-rnic in 
::\]mans -rrian n'tn-n-^man an-" ihtfjai ei . 

i:^wg.;. 7 
FS t'Fi Err 

p/: :.}' ; (■;; 

MT t:-!: r--)4-CiJ-; E:4-t;jl; B04-N04B; ::d2-KC-1Al; iOa-nj^; nrA--hiiA2; 
I :.^-Hl 'Al; bCS-Hlc 

L123 AKSVCEI- 5 ' t;E 73 WPIX COV{r:iGi:T ..O^ll IjERWENT IMEUHMATION LTD 

AN : ;^ ^.^-''^-t; ^y ;^ 1 1^ ^ ] wpix 

DNN in ^-1 -i:5: 4 ^-7 PMC C 1 9 4 - 0 0 'J 4 .I' 

TI 3atab.u:-:e <::l ^o si f i cat i an method l:-ased o>n segu^iince '::'?mpar is c-ns - by dividing 
C'h'A L'T protein sequ^=-n ::t-'3 int'j struoszural wc-rsis taking ints- 
.::S' :cant ^. t a .• ui .a 1 .^nd functional ei a 1 1 on;E h ip s . 

Lp: E'.-4 TOl 

IN KlTTId, P 

FA TA .iEI -N) ;:*t-rd !7r.NE EM I'W 1 a'.!' ATNtt; > '/Er.Tr.I EE 3GE3 
C i* C 

P: EE ^7/':- Al 15-4 a: IE { : --i^ V: >; * :jE MJp G:-ECi:-4:i 

i : AT t E IjE Er. t-:: EC : I .-E 
A!JT' E:' : - / ^E, Al li'? i^^E^'-E'ej 

r;\Ai ET' 1 I i .• 

REP . '"nl .l-e-: 
IE ICl'l G06F015-401 

A:^i EE '. S^/-.. : A EPAB: i J r* 

Met hO'.: ooiuprises cleaving tn- 
Ln:vl..'-je analysatde infijrmati' 
relat lonshipiS . 

Ane ^ t i-U'i'tural vjords p.r'-fA ^rtt (a) naciec-tide SG-quen^i'es ( Nuc: I nd*jx ) , 
P'Tof. tabling into a':::-junf: the degenerative nature of zY.e genetic code, the 
T'lIA re "ojgn It I'l'n seqcen^:es fc^r tiin'aang proteins and ai iowing 
rrclsniatches (M) or (it! >irriin(ja :i d se^gaentes (A^A, Idenlndex and Chemlndex), 
t'ref. taEi::g int'^ a tcount their chemical and steri-t p>roperties ft)r 
gr';iupirig t::e::L. 

nu-::naex in oas-d o>n tne c: ns ideratitTis that li) 10-12 nuclec'tides 
t z-rm 'tne h-vlical ret-.rat , Ei i ) trie degenerative cs-de occurs mostly in the 
tnird, S'tnietimes in tne first, p.osition of [iNA trip'lets, (ill) that the 
i'lncir;.-] sites fc-r regu^at'try proteins usually are segments of 
t-E nucdet)t i^ies , i.-. lialt a helical rep-eat (i). Theretcre mismatshes are 
a^iowei 'tn tne ■:ither half c-f helitai rep-eats, (iv) the significance -tf the 
numtier ot rr.atothes (EigA) 'yEtp-enls 'tn the sequence 1-ngth (EigS); and 
Enendnviex is cased 'tn the ■t^-ns i 'lie r .i 1 1 ons that (a) proteins sre 
either n-lices (heLi':'al repeat tf o-E AA) or t^eta-sheet and turn (at least 
4 A/E , (lE S'.ime /vA It-a^i rt lY.o s :ini't structure ani ;iinilai' una racter i s t i cs . 
Ad-ii t: 1 'jn jl 1 frequently otsurdr. :^ strutturaJ v;^)rds are of lesser 
.-■t iTii f J c firice tnan wtrds whdsh 'jctur' Ie;.s ftequently. 

'f^E.'Al'.EtANTA^E - The niethiS):! is jse:i for iatao^se classification of DMA 
::n :i protein seguen::-ts. Ejitpaiei t-:- pri'jr ant met no is, trie new 
rneth'td is faster ani aliov/s better detection of sequence s imiilarit ies . 
['w.j. E/'i 
Ev/vp / '> 

E3 7' PI EEl 

EA AE 

H3 CPl: E>-i4-Eyl; Bll-CC^-i; B12-K04 

E123 AUEWEE 6c t^E 73 WPIX CEFA'RIGHT EOOl DERWENT IMEORMATION LTD 

AN IM ^3-7-.i& ;,03 [39] WPIX 

DNN N1E93-237572 ENC C 1 9 9 3- 1 3 (>7 2 0 



:{ u e n c e s i n t s t r u c t u i' a 1 wo r d s which 
:C'i.:t structural anti functional 



ziioa - 0 / 4 'j 34 G 1 



TI j-sne inf -i-mat ion teotin^g :ievi::e to evaluate similarity b-rtween amiroacid 
saqijerice an:i r-;: f ei'e Mce hiri i ri- )a :: L :i - jriipr i s s ur.i:: ^: let t: r/j:noer :;t 

L'~in';^e5t :-:.)]TL:n'"")n let,t-;'r betv^s-:n i ne se ^jaenc:^;: 3 an:: ^ di':.::ul at i f:)n '^riit to fma 
r.it i.'j ')t l^n-faot c:':)riirTi:)n letter ietf^ci.ed . 
B^:4 01 ^a.l oa>; SOt TOl 

PA (r'.iiT; F'i:r:T.;M lt[) 

'tVC 1 

F*: JF' 'V.ii9':;-: A i;*o3o--^l ::.9o339)"- 1 CI.'lMOOl-O:) 

A:;T ;F' Ot. laa;;;: a JP loa^-OlOI^ 19iiJi(;n 

f';\AI Tr' 1 101. ]. :t ! 0 1 ' '6 

:■":■'] ''lUMoni-'io 

1 t; G06F015-40; G06F015-42 

Ar jT' ■: A:A.--^ A; A UPAB: 1 .oA. I :..AS 

■j'.oi':; :^lf^:r■I•L:i•: ; jn t'j;itin.i ::-j:u] "i" i o^^s dete:t.O!n uniA. (10) tt iete::t 

*" Tie rioiar)--!- ■■•t ^ hio i ■'>ri s r ':- >ni:n--iM l-'"*:o~s la-: v;ee:'i 'rn-:: ariin') ^iCid ?e::iOzm:^e to 
b--- t^-;T.- 1 .lu-i rV-e r-'^: re r-^n conio-: :: Aol :i : n a -^a :a pxpr-=:sseri r)y iet:ers, 
an.:i i - i -a 1 :i 1 1 on unit HI) t. j ■.: 5iloal ate tne rati:. :A the nuaiha^r :A tne 
Ajnaeat. ■::':':fL:iL-ji-i letters Aete ::t.eA t^y A":e ':iet -o;t i :)n ani.t (I'^i) to tr.e narnoer 
■ :A l '-*.t';i a -jL Tr.e .-ani a'.' a':A i o^- {ij^ri' :e t r^e te.:AeA -ji th-; reft r-en/e amino 
aoiA 5^7uenoe. 

t'oS / AliVAATAGE - Lb---a t^j -:';..ilaare tn-E' similarity betv;een an anAno acid 
.o.'qa';.'r:o- f: i^e tes*o^A ■-^ri:! i retere:\-:-e aiiAn:;) a :A d ;: ::rueriC':' . It: lo :':e::e3sary 
thie Lieve A:'}:ment tA r.eti in-.: , e*a:a Frooi tne Li^"a;:. inf Sirmat i -E^n autn as 
i:^tur^. , function/ . i t. : :' protein ::an :> foan :!. ]n 
^r. -rj^anroe, vrr^ere tn-- -irans' -a':'oi .-e'"rt;en':-e :■:■ r^e te.-^:e':i is ^'Xfaesss-d by 
leoters* "ABCBjAB" and the referenoe amino atid 3e5U'::noe is "Bi-CABA'A The 
oeteoti jri an.- 1 aete':;^s tne numrier :'i tne A:>nqes.t t;':jrjTLon letters wnioh is 
" -t " /* ar.o tTi'i J oa..l i3t \ ■". r.. ar. \ t :"" a i :au . a^ es t r.e Tiat i.s : / c " / ) t :> oPie 

n.:mr'^:r ■:■! A^t'-i^rs -:A tt.-: am: r.-j ■■..oil :: ■E:':(oen. 7-:- : ^' ::^s ^o^otG':! srs:; -iA ^ ( ^= 4 - 
•A r.r:, M.- naivbei 'id i-tier's oi. ' w- leleren:;^ -^mivii .:: :A ^d ;-.'rq..i>rr. . Tair^, 
*:n'- S-ra:i.Mty c 1"- •.•'■il^^tea .^■;.;-ioii :\a ': 'S n r^urr:):-' r:?r':':'^:'S ,z .i n j 
if.ecnani sm . 
[M.vq. La, b/1 
BA '7? I EFI 
FA AB; GI 

VX: CFI: B04-B04A1; Bll-CO":; B12-K04A; D05-H09; 

bOt-HL'; J04-B01 
EFA : \L(}'S-v:AAW'^; 305-iAA9; TOl-JfA'A 

LI 2 3 ALA-WER 01 (jF 7 3 WPIX COE'VRIoHT 2n01 BEF.WENT INFijAMATLsN LLC' 

All 1993-30704 fr". [34] WPL-: 

nilN N1 993-304334 \A\r (31 99 3- 1 A- 9 A9 

TI Aj'ialysis oi irhrcmc'SC'me shaf'e dtrtenrLine main axis oi syiuuetry - by 

f^r^rmim:! carved main axis fr^i^m rui(d r'cAnts cA 2 p'ea< pS'Sitic-ns ot cl^-nsioy in 
iraaqe . 

Li; &a4 ?31 ;^tA3 TOl 

EA (?:AGG) K7^T PF3 INST METALS; CU KPO KIK(jK GtiR? 
CYC 1 

I' I J F' 'J 5 1 :3 i"i 3 33- A I 9 9 3 'J 7 3 3 ( L9 9 4 ) * II }:■ ( 0 1 11 0 3 : - 3. 'J 

AF'T AP tAA3iA-A?^ A JP 1 99 1- L:'' 09 3 2 L''U\7'')3 

PFAI CP 19^1- 1909^:2 1 99 107 a 5 

a; LtH GMli:0"A'-nj 

L33 A^"ABO:0-00; G06F015-62; G06F015-70 

AB :P 031":n3:A A UPAB: 19'>311L9 

I r. the .::r:alys^s a <:'arv'-d mam .-ixis is c-fctaineo by ■ nnect 1 n^::! the middle 
points 2 p. eak pujirits sA 'i^en;/. ity -/a L::e cA pictar^- elements a.'cna each of 
I^r.es f.'O rp>e-nd i-Cu lar ts- .j linear maii'i axi s :A an imiioe of cAirciitS soirie . 

t'SE 'APiVANTAGE - tisoo t :. dotern.ine tne main axis cA a <::hr c^ro s >: me when 
L'denti fying -.Thrc^mosc-ne . Wner.. i'lent i : y i ng ch.r omc'S'jm'.;- , an axis c- f symmetry 
{main ax.is) that seiiara^. e s ymm-^t r i c-a 1 1 y a cnrsmcisc-irie into 2 bltoks is 
deterrrAned, anid, then, r'ari'jus •^irirAlysoz {e.g., detecAicin of. centrcmerei 
are mad-.-. Tnerefore, it is --ery inx-L-rt ant tc determAne prC'perly the main 
axis .in the i dent i f .i ^jat .v ■: n lA :nrO'muS'jmes . 

In an examf'le, a Linear main axis is tbtained c: n an image ef 
thromoS'.-r^ie by the cc nven t i-ona 1 technigae; a density profile is 
taken al^.^ng a line (:<) perpendi tular tC' the linear miain axis; in the graph 



indicatiii'^ th^i aeri^^xLy profile, ^ ^^eak value:;. (pi,p2; ot 'ieriSity 
are selected; trie riiidfjlle poirit cr rre r poir:t '::d,::>2) or. ^:h'/ >:-ax:.s i :] 

r-b^ained; the rnid':lJe p'^-^int fj) i.; plott'-d .it the crj r i f-..-.:^ :*ri'-H Mg p'Dint on 
the ima'je 'if ■ :hr': ni'j:^'.-nie ; ir. thf^^ .;an.':' pr^j :e'.lur e as i;:en t i ijrie'i ate-'/e, tl^ri' 
l:nes (x) pe r'penai a tC' the niiir, axi.:: {..il) ar-j t^Ker. ;it difi-i-rent 
P'.'S It i c n.? , th- mi'iLtl^- p':ir:ts (q) are ob^. ■i iueri , and they :i r e i.d j f; ed at the 
-'r resp-~ r:d i rivi r» j5 i " i r..: or. th<:' li'i^C'E- 'jf ::r. r-oni'i'SOMie; cir:':i, r^y ^'C'r.r.e^: t i Tia thie 
n.ddie iL'C'irit-:; (g) -jr^t a l r.ed, a ::ai^ve<:i inain -ixis f :;2 v/ar: h- r.ijt kilned. Tnas, 
a curve'': main axis •r.-iT: r^e prcp"':- r : y '"'t'*: a i ned '--ven f'.)r a ch r' jiri':' s-jnitj having 2 
jrm:- or. ij'"">th ri':;-:-.:. the ■sers. r-^"'ri lu ( ) . 

dF r EPl td-lr'I 



M'"' ^:rr: B04-B04A1; ^ii-'-;-; : 



L123 AhSv;H:^. ^d:, 2-F I w-i:-: t- 

AM i'/;h--i'r^v:d/ i [.,4 1 wrih 

DNC CI 9 '-i3- Oh 7 3 8 4 

TI 1 -^ct i. ri^; ■: i i'M'-r.U'Ed '■■.■L't i i:i 



i d':;nt :. V i r.Q j 1^=' I^:- 



pra, vKK'Tpni': 



g^vn^/ - r-y ch-: '..-s in ;i the iri n. riurr.h--.' r c- f ftrot'es ne'Sessai y f'l r unequ iv<ocaL 
di S'l^r 1 mi nat i'l n fr^jiu niUtant t oiiLS <:■!: C'jn;i-er..;:us 3'E":}u^!n<:>:' . 



Id 
E'A 

rwr 

FI 



Fl'T 

PF.AI 

RFP 

IC 



Fa 
FA 



Ft/Ad;LLFFFT, h; rr.]-\^ 
(FFFLd FFRTIN t'Ji:, 



!-,W : A': 
W : A' 
Fi' t4tr^-:-" 



AL'T v;t,- 



7 3 Ci 1 4 ' 



Al 1 22 - 1 ^ (1 ^^a : .■. 4 ) ' 

-'F '"H f'F T'?' F? !'1F ta-; ll p 

:a hh FF 

Al 1 ( 1 ■:^'-^F i F-. 

L':? (""J r,n' r-,1.- I.- o c^::- ("-.r.^ r 

Al i2.;'FAdi ; 1 

A 1 ,:*>22t".2::. ( 1 ;^a pp i 
=1 W 1 :^ ^2a^ 1 (j ( L aa: i 
Al Wtj rv^2-FFdl4 1 ]2';^rKF;3; 



1 - 



1 . .^2 ■ i ~ 

ci.jjL : 1- 

dlh'jL ■;■ I- 
: A I. FF I 



1^^'^^2 120^; 



:6b4683 Al FF r2,;il-14F2t i;01l2a4; At 



tri' 



-4 r.7 

A AM 1FF3- 33550 
H-F;4 1F-;^212 03 



1144 - W Baaed en W(J :d KiF^. 
2 ; Wt' 8 t" 4 7 7 ': ; W(2 ^ 2 112 4 7; VJij 



r;^22 120 2; FP n72(2144a W WC' 1 '2 a2- 1 1 4 1 12-j1^j2, 
Ae ;v:.22-22i:i A basea t:n W(j Jr t- / 

FR 1991-14991:. Ia9il204 
3.Jnl.F-f; EP I029*'t'; LP 2 27 262; FF 412 
9222117; V^2' 9jlt5.:9; WC) 9211rt:9; t.Jnl.Feh 
I CM C12C001-C,8 

irz ro-}p;)21-00; 2-12n012-ll; G06F015-20 

92 11 62 A ijr AF : 1 9 :■ 1 1 1 6 
PrC'-seso tor select inig, trc-m a set ot allell,- seriaen ^--^i^:: c. f a pcd yrr-.C'rphi c 
■::en- (Ptd/ at least or.o mutatisn scrreen able to sr-j :lf y at least one 
I'l i''4C'nu::." Leot i'"ie p^r-jr^e tdsr' di st irigal shing hietv/een a^l alleles, i-orr.p'rises : 
(1) 3el»"rct in."; all K'^r pi.^rt ot a ra-pjwn C'jnsetisas sea:uen.:'e (CO ot PG; (2) 



ducir.g a ncitrix ot mutations vjith segaen-iies rresr^: nci m;! 



K.nown 



alleles; (2) iden^ilyir.g ind i see rnirl e segaer-ces tiy pairv;ise :: :.mpar i ss-n 
(i.e. alleles havir:g tne same iriutur^t i .jn profile in CFd and 
ex'^duding -tne memb--r oi eacn pa^r; (4) blent ifyir.; jnd nairberln';! 
■old Igat r-ry ma*, a t i a ltd 'marker' irutat i-Dns i i. . ^a tn^is-r v;t. Ich ^ re 
n--::essary an-i sufficient t di.s t ir:':rui sh 2 jther-'wise i '.ien*: i '^al all^-les; set 
2' of t-td. ig 5;t'jry mu^ t i tcis ) , an'-:i ■' z ) producin;: 5:t. le^:;: one minimal 
riatati'tr. arr-i-'-n jmnir is i ng at l--.jst tnese sh) 1 1 ;(a t'S r y :^iut :;i t ions . 

T-: Lient^ly allel-'S, the s -reer. selecte':! iri ttep Hi whi::h is most 
s.^itarde for prodr.. ot p^rsines tt differentiate bet\;eeri the alleles is 
■:ah'jser. :ind triese prot^es use'ii f-tr hyh'r id i sat i jn typung ot the silleles. 

CFF/AoVA:JTAGE - This meth:.'.i p.rjvi^les rapid and rell5:ble 



1 aent t f 



it 1 :>n 



e 1 



es 



:h.:'at requiring s large 



■ f orjbes 



It can 



Loentify homo^yg-st i d-jublets anii oi f f erent late them from heterozygot ic 

doublets. 

Fv;g . j-0 

CPI 

AB 

CPI: B04-B04A1; B1F-F04; D02-k:^; F02-H12 



A P F. Q E r 5 4 9 8 1 A U PAP : 1 '--^ 1 1 1 6 

Svlect-i:^, iroin a set allelic sequ-nces oi pr, : yniorphic gen- (PG), at 

i-ast ,'r.e mut-ition scr-i-a: ab.e to spe'.:ity at l(;a^;* ■ .n^- ol ig ^nu- i eot ido 
piooe i.jr distin.:miohirw r.etwe-n ail ..lieley, comprise? (1) sei^-cMng all 

par*, a knovm r:onor.-r;::^us s-quence 'CS) cf P^; (. ) producmq a matrix 

■L 1 mut-.r i.,ns witP ooquvn:--; ..x t-r (-;-:p , kncivn aLIcl^-; f ) LderUi^winq 

ir:dLsc^rnib:e sequence- c • • ^ -ii rwi ,:e ::nip:iriron fi.e, .Jl^ieo ha-inq the 
.■•o:rie mat: a : r...n profile m ■ 'P ) ::riq -rxc P.jdi no ne :nenib( : ea :n 
^■Jir; in) ^v^eat i L yi ng and nuMtc^-r:: irr.: or: 1 i irj *: -■ r'^/ iQot a^- i -t;.^ ard '^^i^l'^-r' 
r:i-iti:-;n3 (i.^. -hcse whP-n ar- n-:-e:n::3ry 3 nd i f i . : . n* to ii3^in^ur^h 2 
■-■^herwi-e Loentioal al^.b:;o; s^r or .:.t 1 1 ^::. t .m y raat. a I ...n:? ) , and - ) 
r:::iu-:n;j i*: ieaat: r-ne i n in -i : rn-: ;:::r.:c:. ocr n : .oi n g at ^e .st th(-ie 

:i--ip^':'ry ir.uT.at_cn-. To :oent:ity alLele.?, ^h^ oCieen ^..-l^c^ed in .:te^^^ {b) 
v:h. ■:h lo i.ivxit sujLtaPle [..a: j:r.:..;r.. oi: i:.ro;o^o oi t te- r en : rat e l::»erw^^cn -he 
.l.el.~ 11- r-ho-sen and t>..-;-., r:-r--re-: aoe:i for h-,T r i ::r;. :.it r tvomo rd the 

r.J 1 1 e 1 ^ . - ^ - 

•d:p/A[)VAi;TAoP - Phj:: nethrd :::ro-i-ie- r,.p..d and lol-aPle 
rdenril loation of alleles wrnhont leauir.nq a larqts aurrPer :f f^roP'Os. it 
oan iderP.:fy nc mozy ;g':t ic d^-ajPlfr* s anc ■iiitereritiate 'hem ''r'n 
heterozyq-:-ric donblets . 
Pwq . 0 / 10 

ARPQ FT; :n:"4P:-B A UPAP: 1P^3::PP 

T r--' >C*^ y ^. . •,-.f.r. I f-:- ■ /M cO .-^ - j- 1 - . -,"'"1 V' ^,--,+ , ^ , 1.^. 

■ ' - - ..4-. . _ .. ^ . ...... ^ . ..1. a^^ pdj. L. '..-^ ci ra.-i/jn ( vat i;:-. ero'^-i.j sequence 

;^ ' '-^J''-^' pr:ducinq .:i mat:-!:.: ot natations V7i tn aequen<;a;s r-jr resp. to 

':n.-on aibi.oa; (P; 1 i-er t: : i; y Ir.g . n^.nax. e rn rble seijuenz-^a ry p/.orwise 
V':Tr.pa2 is-'.n ;i .e, alleles having the same n.utation profile iu CS) 

excluding one member c- 1 -aon f^air; {4) 1 :ien: 1 1 y i :.:| fir.i numb-ering 
:. r: - iga t:a-y mutations an^;; '::i.^ik-i:' mutari'ims ! i . -i. . rn..:;e wrii-;:-h are 
neoess£iry hnd suffioieni r.r. :jis t in :|ui sn 1 o:nerv;ise o:ientic.^J al ele^; =^e^ 

0 ■: f ODl.Miatoi y mu-ations) . .n:l {':■) f r ..oi .001 : : ^ :it. :.oa:- min. al 
:;i J t at : O'-u'een CL:r[o:ud 3 i r:o .it lea;:^^; rp.-s-- r ]- I i p t- ^ ■■ r ^ 

UPP/AL:VAnTAGP - ^ Tn i s ' m.. t n-: :j f;r:.vi ies laoid ana' : c 1 labl-^'^ " 

1 .^-nt 1 f 1 :at icn ct aliel'i'O Wx*:h.:ojt requiring a Uru^ t:o. .^f probes. It can 
Identify htmozygotic doublets ani di f f er^n: 1 .it e them from het erozyqoti c 
douh'let s . 

Pw.j.0/10 

L1P3 A:JPWEP 63 OF 73 WPIX COPYPIGHT 2001 PEFV^E:JT I Id- ^PMATIOP LTD 
AN 19P3-02 7P10 [03] WPIX 
Dim tjl:.H93-0P08Pl 

Tl Fuzzy interference appts . e.g. tP.^r tunnel trenching r y shield machine - 
multiplies obi ivescenco coefft. oy memt.ership function 

or conclusion portion ot each operating linguistic cirdiro:! rule, with 
coefft. oete:rmined by fuzzy interference to vary Vveiiiht of importance of 
given rule. 
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Tne ^pr-ts. includes an operatm^: --riitle fuzzy inier.:.nc^. pc^rt^tn -dO) 
which receives input values, ^ . -r . , a "i-viated dist-nces PI and XP m 
horizontal and vertical directions retween 3 plann-d orenchmg line and an 
actual dire-tion or advan:e of tne shield mscnine. A function 
iv'^'^'^u^^^"^'^''^' " '"'^ ^ str.iPe quantity :: f each sh.^eli macnine jack 

t.hriugh fuz^y ^nfeienc-. Tne p-:rti:n comprises a linguistic fuzzy control 
rule storage portim < operating rule storage portiin :i) .a membership 
function storage pirticn an col I'/escence coefft. storage 

portion (3; anl a fuzzy inference mechanism (4). 

The operating rule stc ra-^e -cntams a num.ber e.g. n Liriguistic 
contrsl rules, which recei-e XI and X.:. as input values an-i provice the 
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'zroKe quantities Yl, Y/, , r^-spect i vel y , tor interred -values ( f.::oncluded 

'at-.:e3) oupplie':! t C: ] a >: p-'^rtion, of. ^r^e ,;:riieL':i r:;a':r. i:.e . 
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Al t'l- ex EP 1 -AiY- ;^I 4 J.AJ I9IA.061I, EP 



Y3; EP 5Y00YY Al Bas-.^d on WI' Y2Y2Y';:; JP 
; Ad r75:d6 B Previens Pnbl . AU -IaA:509, 



e;: y.-^ 'A: Al a;i -^4 04'm 
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I : AT^ BE Cl\ EE ! A i 

1 --aAAA t; EI I ^YYYI j,\ 
C t: t't Y 'A. Y7 W Vv'I 
B Ad A-^-^^-2.:YI 19 -ot';-!:-; 
] 9 99 - I ii ^-.j 12 1 9 .A; 06 1 
A A 92 22 00 Y A B.jsed •.■n W'l 
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A rtieth'-.-u for niaY.in':i a hun.-ai: i sod ant ilj<:o:y <-:(jn.prio i n an ani:.no <ioi'"i 3equ*E-nce 
■:A a n'i'n-horian, imf. -'rt jntifc'joy .jnd a huiran antit'oYy is calmed ';:ijmpr i sing 
(a) tt't ui n ing the arumt) .:i.;:d so-guenoe:? " ol az loatt a por*:-^on r-t .":in impicirt 
v.":iriablo re''ii':.in an^i c-t a '.M-nsenous nun. an v.^riable* dC'inain, (b) io^ntifying 
C'MnrileKierita r y [lot e r :ra niriu I'.egAon fCII'-) aiitir^C' 
ar.d the r.uman vari-ihAe ■i'")i[Lal:. jeAuen<:-e~, ic) 
aiulrrt a":.:d /e:fu^n:-- ror -rn- o tr r-srooi-::! r.g num. 
[■:!) ^ligriing the ana r^j aoi^-l ;-^equenoes ':d Vt< 
iriLp'art fintib'jdy and th- -o:) r r-i-^-pDndl ng EF; z: 
i :lerA:A ying iriport ;int :a:o)';iy EE reslor.e::- in 
ar e non-homologous ^.o t.no rresf 'r:G i r:g c:'jntensus 
afLt i C""j'."iy re:.oA:ljes, (f) .i'^t: erruir.ir.'^^ if tr.e non-homologous 

I:T:p.:-rt ..i]:\in:- r-r.^.ioio- :£ r-ra-Otn ii: 1 y oxpeoted ij nave at Y-as^ one or 

the f ■:! I ■.■jwing effe:-r.;y; ■. d nor.- ^--'Va I onr iy i:d.nd3 anvigen iirectiy, (li) 
intera::t.? with a Cr)P t)r (iii) p a rt i :d itea In the Va-VH interface and (g) 
fjr ar. / non-homologous mp-'rr :;n-.ii:::iy anAr.j a2i:i 

resi.d..^ wnion is t'^ json i^bl •;>xperte:i ht-iive it ].e:i.v-t ■:'ne af tnea-^ 
effecta, su:.^:-; t it ut i ng that r-sioi^e lA r thie tor reapondl r:g arnln:.' aold 
r^F'Sidue :n *■ ne car.ter.sus -lintib^dy EF. seguerice, 

Ala.' Claimed -'A) a r;etnod ivapricing :a; proviim;: ut least 

p'trtian ■■■>£ .-in ImO'tr^. , n jn-nur^Lari ir.t i;:-:' dy vari.aoie dirriain amitrO acid 
sequen::e ha'.d.ng a ■YYr: and i EE, b) ;t'taining the avin:' a:: id segue nee of 
at least a portion if a c:;nsensus human antibody variable d:.m.ain having a 



aci-d equences iri th^i- imp'Cirt 
e.ut:).:; t - 1 ut i ng an iTLp).:o-t CP?, 
an CIA\ ainin'j aoiid s-'-juencf^, 
amevycad: F.^-giiin { EFC <jf the 
■:Mn.} ens us ant i ti'j'Iy . ( e ) 
allunea EE seai:en::e3 that 



■":DR an':i a FF, (c) substitutin-g the non-num-iin ■::)P for the human ^:L'F in the 
::'n3en3U3 human antib-^dy ^^ariabie ijiriairi and ((i; s ....:.bs t i t ut in^: an amino 
a-:'i'i resifiu-- tor the ^-ons^-nsuo amm: aci.d r'esLdue :it at least cne :f the 
f.iJicwinj sites: 4L, :oL, ?6L, 33L, -FSh, 4-;L, 46L, bSL, h2L, 63F, u4L, 
65L, t56L, h/L, u8L, ^::-^L, 'J'jL, 7]h, 7oL, B^h, -r/L, 9SL, 21i, 4H, .MH, 36H, 
1'7H, ^'^H, 4 -.H, 45H, 4 Mi, — H, -eH, -'/h, f3h, u ^H, 7 )1], 7:-'H, 74H, /7H, 76H, 
7'^H, MIH, 'U'H, :'3H anc: 1C--H; (Bj a hurnafii 3-d antibo-:ly variable ■.iC'T.jin 
h r/mq a ri'-in-human CoF in: ■■jrp':- roit ed ^nt3 a hum.ciu antibody variable cicumain, 
v;[:ere the iriprovemenv ocmr risea auo.: t i t :t ioig an ami:H.' ac^d realdue tor the 
111. man lesr'ivje at a site s--leoted 1 1 tn-:3e detine-i :; n (A;/ (d) ci 
polypeptide C'tmprisirK: the amin-j a':-id seau^i-nte (I) or (IT), (o} a 
metnDd for -..n jmee rtr.q a r.:m3ni.-;ea ^ritiu:'i;/ oompri3i.n.q :a3: rod .c.n j um:.no 
.a'":id residues frtm a:\ imp::t:t in*:ib:uy var-,^bl*^ oomai.n ii:t ^ an ar^nno) aoid 
sequence rer resent in a C':-n3ei;3 js -it mam:i^.illan antib-'dy \- i.a;: ; do^marn 
aequen .:*;^s ; lE; a C':int'\:ter : jmp r i s i n j th^:r s * ;'qaer,0('! la^a ot tne ainino aci<:i 
oequenoe 'III; or (XV;. (Fj -i otmputer r -rro-esent ^a t i. ^t tl' -.h-> .■^]n..:n ) aoioi 
■■equen'jes ^III) and \ iV i ; {o; a netno:! otrpirismg st.rrnq a co-m|-'Uter 
representation of the aminO' aoid s--:-quenoeh {111) and (IV). 

VoE/AoVA!]TAGE - The ::ei:h';>l3 allow t:he ra|.)id hum ^nos^it ion o: 
aritibodies and lequire less OUA synthesis *:han pr-evi-r-us m.ethods. Tfie 
humanised antibodies have desired b^ndmq and (.■ther thairao ter i st ios and 
:i'.:ti -/i ties tut are less antiqenic : ri human;-; than nt)n-hunan an^ it.o lies . 
!u;q..;/ u 
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Ad 1^^91-3-5101 [47] VJt-lX 

DdN ;n 'Hti-iQ -.f-i-t. one v:! - ^i-i. 1.12 ■ 

Id 'lorrection of algtud^thm f-r identilu^caticn cif bitd^qical sample - involves 
■iM'mpar in -J p-arameters with stan^-iard values, correct. nq membership 
characteristic function arid identi.tyinq type of c^rqariism, 

DC 81)4 [lib JC)4 

E'A (tiMFO) OMF.Ot; CORP 

CYC 1 

PI JF 04283870 A 19921008 (19924 7)-^ Fp G0oFtl5-62 

Al'T VP 04283876 A JP 199I-48I18 19310313 
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IC ICM G06F015-62 

I'':S G01U033-4&; G0IU033-4 9 
ICA C7O6FOO9-44 

AB JP 04283876 A UPAB: 19931116 

Bicilt-gical samples including several orgar.isms of tv;c or more typ'es are 
driea and stained after applicat it>n, the images of :ne c^rranisns are input 
as irriage signals, charac^t e r^ist 1 parameters f i^r ea :h cugar.ism are 'tt'mp'Uted 
from the image signals, ani th^ tyi'ts cf the Mganisms are identified 
using the characteristic parameters. The ch^r hcter ist ic p^arameters obtd. 
from the t-.i o l<-.gi ca 1 sampl*^s ^re compared v.d t n the characteristic 
pi.i rair.ete rs (standard v=ilues) used tor thie determn. ot Ideriti f icat j ■: n 
algoi ithm. Ac'tiorii irig ta tte v.^ r 1 .-a 1 1 'jn , t. he iterth^e r'sn ip- c-na i"ac te r is *. i 
function of the ident i f i ca 1 1 c^n 3lgC'rithm is C'lrrec^.ed, an:l the 
tyt'es of the organisms are i':ierJ: 1 f 1 ed usir.;; trie correued a 1 go r i t .hiu . 
U3E/AFVA1ITAGE - Used to oorrecr. the i d-z-n- i f icat i on aIgori.thr[. in 
Hssi float ion of the types 'Organisms existing m 3 Di'-'l^jg i.cai sample 
usin-:j tne characteristic taratieters based up'On tne mo rph j logi':.:a I / density 
t;cJ-:.ur) characteristics of janisms (e.:i., tell .o:jrting, ieMriocyte 
Si-'tting, etc.). Tne membership' chara rter ist i. c function of 

identification algorithm is ■:.t:'Mectea acctiair.g t ci the variation r-:-tv/een 

the characteristic p-arameters ■■-btd. frcm b 1:1^-0133 1 £3mp>le anci th^jse used 

fcjr determining identification algc'rithm, even if the charact '■■ristic 

p<irameters vary, classif icatic'n can he mia ie with C'jmt arat i vely gc-O'd 

ac suracy . 
■■■ / c. 
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.BJ402; AB '.^2 19373 A Ai.i 1992- 19878 
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IBB G Cd ! J ij 3 3 - 3 5 7 ; G 1 M C' 3 3 - 5 ; ' 3 ' ) ] m 0 }; , - 5 ; ■ 1 1 B ' Oi ^ - ; GO 6 FO 1 5 - 4 0 
; G06F017-30 

AB W'-' f'^ 1 7 78 4 A ;iFAB: Bi'iiOllOf. 

Ei'd-rnn. <: f the analyte oc^ir.psn. or ^ s^nrile ■■;.:o.t;. -J least, one memfc-er 

ar,aB/te cd a or-r^ss-reaot ive set ■: : analy:es Mupri s-'o- : (a) assessing the 

s.-^rTLr lo \;.r.*:. e^^^on ol n in: :.'iTi. u t . :.-n idianriels in x o.uKd 3:rdg. n 

inf.: rnr.at: 1 :n cnannols foi oB r j ri s t i s paran-E-t. e rs the nemt^ers of the 

S-: : ^^Bn^in a survey of .Ji.n r ao* er i st ics ;EB'7. profile; ar:d (kj 

C'-.':tit ar 1 ri^:j the 32' profile 'wit.n r-fereno^ sev ■■■i.' 

profiles C'Btd. frim ■;-j:r.r. on.- . -jf the :ae::.::ers of the 

crc^ss- reaot ive set. 

UBE/AB7ABTA3E - Analysis of samples for re--:oanlsi ng classes cf 
anaiytes, e.g., herl 1 c ides , :n oiinical an<. lysis of Biclogical sarr^ples 
(e.--^., sterol'. is, protein h'Srifion^-.'S or their rr-rt ab'Vl loes ] , 

aitificiai materials designed f^u" therapeu*:!: use (e.g., 1 eta-1: lookers , 
cheK._;ther af-Butic agents, prostaglandin, ml'd:. i ts^rs , etc.). Illicit drugs 



":iy be detected arid distingLiiyiie i for forerioic purp'-s<^s. 
[ wa. lA '3 

Fs :;pi EE-: 

FA AB; GI; LCU 

'.'PI: B(N]-L-;OPC'; C04-E0JD; B04-B0.:F; C0-1-B0:^E; -BO-1 ; C:)-! -F D^C^) ; 
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IS immunologically cress reactive with at least :::.ne corr.p ^ementar Ity 
aetermining region (GBR) c^f at least one antioody (At:), the GBR having" a 
plurality of hydroxyl gp.s . positioned m three-dimensioanl space, where 
(I) corfiprises the same r.umber of ny'ir.zxyl gps . as the CBR and there exists 
ar least one conf orma t i cti of tne synthetic cpd. where the 3-0 posi::ioning 
of it:s hydroxyl gps. is identical to ten 3-l> positioning of tt:e hyirc>xyl 
qp'S. ot tne GDR. The synth-^tic cpa , (I) ].s of f.Drmula: where Bl, R2 and R3 
- chemical functional gps. each i^ompi i s ina at least 'jne hydrcoiyl 
gp. 

USE/A[iVAIITAi3E - (I') may he included in cc^rnpsns. for adnan. to 
mammils to elicit a prc-pny 1 acti or t her iipe j::ic im:nane response. Methods 
ai'-^' provided for the prepn. of synthetic cpds . which are immunologically 
cross reactive with individual At- GCT3. These cpds. nay t^e used in 
studying the role c.f a GCG' chenical functionality to Ao or GL'B 
astivit.y e.g. in s:.u:iying the i :de ■: B sp*;-citic a:nin:,.ic:id resioues in the 
interaction :.f the mmoilor.al AB 87. 93.6 light chain varidtde region wtih 
the re:.virus type : recertcr. C-yj.^ s nz . cim.prising re.ativley small 
synthetic ^pds. may te Jevelcped tc meiiate the 3G:d.gical effects cf Abs 
cr other ligands. These sc is . sh:uli py^^zes- benefB^ial properties such as 
iri creased nail -Life, -he ardlity tc cross the blc co-r^r ain barrier, and 
la:r: of immuncgenicit y ani thus snculd F^e useful fir the cevelopment of 
new p'harma ::eut i ::ai , therapeutic, diagncstic and reseptsr F inding aaents. 
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Mf.c r .'^ . ."-T^jpr Lse;-.^ a genome cnar.:x2~'iL secuen.>j a:ialy;-j'rr with a 
win-jr-w function un.it t^:) c.'.:an a character s-rquence. Phis 
o^^':-:! f ios Fho genome to trierel:y ex^r^ict th-? characFer sequence. 
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Ii C'btaariinq signals represent mq 'ii.tterent sc-ecies m analysis - using 

ratios or signals obtd. m .-JifPerent mannei an.ri reiioting ■iirferently to 
SF^eoies, used to obtain F!IA sequence mf 'Drmatiion . 
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Two signals whicn rea :t different ly to cifrerent species, fcllowmg 
species separation in tine or space, are generated, each signal varying in 
an^plitude in respective senses to form peaks as a function of 



Che Jidlui- o: the .species, ea:n peak being repi e- ent able by the sum r-f 
thr-e or :rijre p.eak teiTifiate: . A template i- dei iveu f- r fattma to e..ch of 
til-;- ::,ignai:: t -) oot....:n th.- p-ar: an.nlitjde oi the -igna.s ana a :'hird -ignai 
Lo ■■ierived, rrcm tn-? rati^j r.f a function oi the i:.eak3 of the 
r.:ror and seo^i-nd Si-.uials, the am|:ditule t the tnird ;.ignal being 
i.rio:(0:itive the oJentit:;^' t f eacn ,: f the £;peoie.:. 

tioE/ AoVAdTAGE - ?ar^:.lc. in cbtaining [/NA s eo^u-oi^-t ■ i n f o^rn a t ion . 
Allows tne unaerlyinj peaK shape to be pericdioally rodetermin^-d and 
^J^-^'\to tne dat:^_2n a way^ th.- yield:? aooutate latu^, even when the 
pie-.-ij-e -iijOvition :-r the peal: i:i nii't knc'wr.. Tne metn^jd ccntairis s^^-lf 
oonsistency checris r.o that ji'Otenti ally erroneous ratics, ana honoe base 
^; t '--rir m.ro: -^ns mn.o/ i^^i !0:(:ooi: . ;o-- . 
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Oeternm. of ElIA sequenres comprises generating signal t>y a sequencer that 
varies in a-if.1 i tu-ie to form peaks as a function of the nature \:d: 
■ ne speci^;s arid q-r.^rntjng seC'inci si.^r.al. varyLn:( in anplitude in a seoc-nd 
.:-:rise .liffej -rit. f rr v tn- first. A frattiori ti th^ tirs^ si:inal is 
s-ibtractec i r om t.be seo".'r::i sigr.al, fr-i-m. t .he peak cent:r'L- cf'tne first p*^ak 
^ pt -a f: centr-^ ti.e seoond ^;ian::i peak in ea::h peaK sequence, so 

amplitude ot peaks m e.^ch difference is zer^:, :ri vmq a prot r: t yci tal 
^^::mpl-te f::u tne i-e -A: . The template is fi:te<; rr. tne" first and seoond 

tc oiv.- a thAci s i jnal iniif-u^ive jf tne ilontity of each of the 

Af/ANTAoE - imficveo ico^ii^cy :: seo..en:-e . n f .j r :ua : it n , 
ABEQ EF -lo?:?: r bf AF : 1 w bAB-: 

A methDO of ident:. fyin; the sequence if b-^---. m a cartirulai Di^A fraament 
by means of signals characteristic of reporter rrtAecules associated with a 
particular base the methoo including (a) electrophoret ical ly separating 
fragments m rime and/or space, (b) for the particular fragment, producing 
an emission spectra which is passed through sepfara^.e filters having 
aifrerent characteristics to generate a first and seoond signal, each 
signal varying m amplitude as a function of the reporters 

associated with the bases m tne fragment, to fDrm pear:s which correspond 
to a particular reporter associated with a particular base, any peak of 
first and second signal being approximately representable bv the sum of 
three or more peak temp.lates, and (c) obtaining a third signal 
corresponding to the ratio of the first and second signals, tne am.plitude 
t'f the third signal being indicative of the ilentity of each of the teases, 

characterised m that the third signal is c.btained by tne steps: 
finding a sequence of peaks AYA in each of the first and second signals 
wnere A represents .me h^ase and Y represents anotner base, subtracting 
from the peak centre of the first A to the peak centre of the second X in 
each signal such that the ampdituJe the X peaks in the differente is 
substantially nero, thereby c.brainLrp:j a prot yp i ca 1 template for the peak 
ot^ each^ signal, and fitting th-- respiect i ve prtootypical template to the 
respective first and s-cor.d siA^^al spectrums to 'totam the peak amplitude 
of the first and secono signals f ^r ':.b^aining the thir.i si:jnal from the 
r a *: i o o f s u c h p e a k .a mp L i t u e s . 
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r d i l e; ■ r. J re: t 'j S't' :-':'e^. :~. i v^■• .-"•eor.r.ing pc s l t i t ns . Tn^i- u:^er niey credte custom 
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r " jr.ni:.-} ■ t axal;^-s:.:- function, z^oi-^= i r.: p re: ei t^.r ceirain 
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A iata Lir'''e(^s sin ;t sy;:.*:en -r'lir.pi x ses 3. '?.':-mp'Ut er , an imag^e S'l: inner and a 
natural ^jricujue interfaue vspc nsi v-:' tt; user i nitu: . Tne system creates 
ari-i exe-^u^-T-s. a oust':-m ap'p'l i '-a 1 1 c-n pr^jaranmie v^Ltn a se'iiuen'.^-r of 
l^ser -sc>e':' i L G ed so f tx.v,":ir t^ tt'<.ds tor soarining an analysirig^ an image with four 
vert ioai s-vct i -jns , eaiGn witii n-: tr s-ime hijrizjntal P>and3 witn nc'ri- uni torm 
p- r- imet ei s . 

T'j.jA: are stor-o. f<tr i:: : tet-'t i ng scanning and an,a.lysis .-inio custtm 
prtigramme is created by sttrinu a user-defineu sequence tf t<j<jis using the 
r.atural ^ 'irirjuaae i':l-u"jt i iter s , ."^nd the p rt-gram^me is execute'.! t it scan the 
i]Tiage anri st<tre :he data, us:ing the -data zo Ljcate the horizccitai P-ands 
ar.'i t't ci-'r relate ea-.-h of tn^-se tt' a p redeternined shap>e. 

tG"E AL'ttAGTAGE - ^-arti'^. ftr analysing jNA aut 'tra'.li : ^i' im^, fC't general 
iruaum'T, i^v^pcGat i ons screer: i an:! dia^nt'Sis incluLiin;: AID.-" ^^'srinj, and 
mi'::r t-- 1 1 1 r e t'late r^aiiin^. Fd-t^'/ioes ni^h r esol utit>r. ar..a lysis C' f a. wide 
range o! !':.'ri::ats in an i r:t - r a':", i ve cc-mputei" envir-'i'tment . 
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IC 001IJi;9 : - /6 

lO'M (9i:!lN0 37'-'.7:; oL INO 7 7-5 ^ ; 7:0 11 17^ 7 

I C3 70 : 1 u 0 9 9-78; GO 6F0 1 9 - 0 0 
IlU 5'-^ : oiJ 

AB V;(9 '^0 0 8 92 5 A JE'AB: 19 9908^5 

(A) A nothc'd fov aetorrrAriir:'^ :f a pr':.'0'r:c:;rit wornian io ot oionif ioarit risk 

carryiTiq a fetus with ac-wr: syrnir-jmo ; L7' j is '? Ia inieo o^jmp r i sinq 
n^-asuriri'-j a p'r^rt'^nanf vM'inon's na^ '^r L/::! s^-ruru i<[iv*i'l '7f free k^et a 5u::-uriit of 
n^jman <_-hori c-r.: 17 q'jn,j«:i'77 rC'P'i n (oi7<91, iri'7ijr|;-':-rat i nq tLe mea.?ure]riorit cf the 
lovol and the joreanant v/rj^ens L^os^oat i <jr.al aqe irioc. a fi^ro'tiat'l I it y o.onsity 



function 



v;ith a sot rL-:-rirLat i '.-o dat.j t'7 rjf:-t orrr.i ne the 



p reariant womans risP -j: oarrymq a fAtus v;itri Li3 . cB) Alsi7 '.^la iiuea is a 
luotr.od tor do-".ormiMriq :r ^■■rO'^r.ant woinen is at ;:• i ari 1 f i cant riso 7f 
':-.;i:xo'iri'.? a fo^us with 7o c-imp'risinq .::5say.na a pro'irnant wr.n^ons l:)l7":-i:i for 
free i:.^--ta sut^unit -7: HO'G, the re:rul*:s oi tho oS;: ^y r)0 i nq indioativo of 
irioreased ri sl-: of fotal r'3 . Tr^*:- moth7'd may f ortrior 7:'jnL}:ir- i so assay i no a 
proqnant v/'jnions c-ic---.; f'_o a Ipha- : ^ot .[.prt)" o : n (A), ' '7 ) A1;oj ■rLaiinod is an 
assay for nLoa.our i n-;! a ijois-^ns r-^ ~)i:ol Iov-1 th-- fro^f' piota subur.i: ot hCG, 

([J AlS'j clair:iO':l is "in ap'fits. f r-7-oivinq a moasi;r€;ni*E-nt a i^reonant 
v;7riu.Tis niatorn.-il tl'jv'd It-vol o\ ' fre^.^ rj07a subur;it -jf h079 and a 07)mputer 
t7r .T'-irrp" a I'i na t. 710 in- is ironiont of t h-i- I'O/o: :o a 3-t ot roforen7:e ilata to 
-aot orrriine fot.il ■rnr-jrri::).- -^ma 1 .irinvorinal i 1 1 es . 



Ml- * 



o r 1 prO'ii7ts a nijii-T P'Oi CtO'i t i':-fo ot 



fotai r'3 .^asos with a lesser fa7.:'e p7'Sitl^'(^' rate- than cith^^r kncrwri methods. 
7i-^te':-t : toi effi.;ienoy f^r- Trisomy 21 as hijh as 83 has boon a::nieved. The 
rnr-th-'i ::an -tlss) t^e 7sed lor d-r;t-:7ting ohr 'mjsomal trist'mies su7h as 
tris7riy 13 aroi tris-oiuy 7r. 
Lwg. 'S / 1 4 

oF'i ep: 



M3 CPI: B94-B02D4; B7)4-B04:i7; B'i4-HU4L; Bll- 
D05-H09 
EE^: 3-)3-E14H; 395-7 



511-CJc; B12-K04A3; 



a q e 1 IJ 0 



ABEQ \]Z 



.■ - J ' 7 A U FA.B : 1 ''-^ ? 1 _ .7 1'"' 

determiniri'j .1 ; f ■regnant, w^r^rnan is at. significant risk of 
f-'^etus v;itn Lc-wr.':.. .ynirnnie measures thie maternal serum level 
fie*:^ f^-ra human -h...ri.:nis g^-nado t rophln and has a computer to calculate 
a set oi n-jrmative iat-.i anc :n<;-rp':.t at the measurement and the 
■;-stat:on-L :qe ir^'o n r- 1 '4 -j; i : t y :ien;:ity function, thus 
t'tn.par 1 f.._i th-- ac^cr.! r -r.-isai -r.en* s with the ncrruativ^r dat.^ set. 

computer j-i:-:. ge-tr-taTf,s ^:he function tv a linear 
■ns-iirrcn..nt an-ys— pr t-^-.-e . Me- sur ement s cf^inta-f human chorior.ic 
g :.nad'~)-^ ft phin, alj.hd-t'-t: protein i^v.i unc ^n i uqa t ed (.estiuc'^ may 
rxxs..' ti^r .rip'Ut mt.:. nne c .: it,;- i.:r ei f':c: ccrpasison v;:.th nc^rmative data. 

.7. '/;7:7:AdP - 7 SM ^tly iCMcncts . lilgncs pe]-centa^:e c-f 1 cei al Down ' - 
:-yn-i::ire Case-s wiM. . (.:wec :aj;-e rs 
L w . oi 4 

A3EQ tt' 77d,l(o"S A 7PAB: ' ^'^4 !; - 1 (; 



iigne.s pei-cent. 
:ive rate ^han 



e c-f 1 cei al Down ' s 
o n \' e n t i : r . a 1 me tnh o d s , 



L-e-ni-r:] [. rf. cjnanf. v;'-.r:ien l-.^r a [. own V? syndrc me f(se^:us ccn. prises measuring 
t h: : :i l--.-ta-H7'J ar. J n.f. a r j n^.! this and tdie woman's :Je^ 1 at i oria 1 age to 
referen .-e values :n r.^regn^^jnt wc-men with normal /Be. wn ' s syndrcme foetuses 
high-^'i- I'tvel of t.-tt i-B'-'G in^iictes a higher pi oh-ab 1 i 1 
syndr7'n:e . 

AE:-VAh^TAGE - eater p^ticentaqe foetal Dc.wn ' 
less^tr false [iiositi-.'^ r.:it-. 



■tf Lawn's 
syndrcime with a 



L 'v i g . 11 /' .1 J 
ABEQ EE ' 4 7-1^*76 B UE/;B: 1 
t\ .' ; L [\ ^' 1 ' 1 ■ ■ " c e r d 
carr:yin;i a fet^s witp 

v;.trran's td-:.oa ft. r f : -je r.:imsn rn 

r-zsuLis oi Che assay c-rina mon-nitiv 



^ " no I.-;- im.i n:..na ij. 3 p: ■;.u..c;r: t K:m.an is 
iwii syn'-jLon.e 7oirC'r is J rrj : ass.Vyd.ng a pregnant 
litni- g:naaotr:Lin {hCG} , the 
cf in:;reased risk of fetal Down 



■vop _ 
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EF' 361214 A UFA3: l7--'^0g^7 

Nr4P spectrum of each lipoprotein C'lnstituent cf hlo:d is stored as a 
reference spectrum : ar that constituent and the measured NMR sp-ectrum of a 



T'ldSKia or striuin ociUifjie jf bl'_-::-i no oe anaiysea coinpdred v;ith a 

■:-a 1 cu lat e :i linesnapf- pr- iuce-^ : 'y adcinq t 0'.:ezrie r vA\e re ter^-:i^':e spf-'-rr\:v^ 

I 3r each ::criS : i t ier.r , i:. an.t.r .:et e nra. ne'd y re^p^-ctive const: 1. 1: uent 

c3-fft-. whic:h are .jijuoted j:-.Ml the cal at eci Imoshape riatches the 

rieasur-cj lIMP spectrum ot the .o^ifipJe. The -ocwncr. . of at least one 

.1 Lr.opr-".t:ei.n constituent tn--- iol :-od samp ie can he calculated as a 

function t t.e c-^ns t i t u-nt USE/AT VANTACIE - In the 

It e.j:-.ur^;nent: ci chcdest e: in t^c^ic samples. Ccnst it cent s od: blj^^d VLDI, 

hOh, Hl-L, ana protein ri^ce 'iK'r spectral pr:peitles invariant 

frci' pM-vc:..;. person, any fe recces :n NMF. spe:trun cl hlo.xi samples 

are .iue t'^ tho- 'ixf f ereri'.;es i.n if.rvels c-f tr.e constituents. 

f '^:cr;ss 13 si:-.: :in ; reliable prcvides accurate m-oisurements 

:: : .. :p ( -pic ts-in '.ontent b^:):c:, 
4 /' 4 

FS 'VP I KM 

FA AB; dl; P'"'d 

MC cPI: PCl-Lhc; Bc4-B;4P:.; P04-L:'-4Ld; Bll-Ai^A; B12-K04A 
EPl: ^0?--K'j1; Sii3-E14H1; SO^-<:tl; S05-D02>: 

ABEQ 4933-44 A dPAB: 

The .ipc^protem constituents t-: blaoi ar':i ::.easu2~ed k y (i) storing the NMR 
sp^ectruni each 1 ip '..pr ate m ■.^■onst i t uent as a referen-e spectrum' for that 
■::(:nstit uent. ; iii) a.^cuirLr.:f an ^iMK si(jn::_ [:.rodu.::ed ty plasma or ;^erum 
s^iiT.ple of *cie bJijod r r, b- ;ir::ilysed in an spec t rc vie t e r ; fiii} p.roducina 

■ifi hMP sp---:c:rum ol the s.nmplcr ty t: rans f ::ccm ng tne siinEil; 'iv) producing a 
■t:-i ; tuiate ::: ^ir:esn5p'0' ty :T-:ci]r.'.: together t;\e re f-xeric-.- spectrum fc^r e^zh 
constituent in amits. det-rmm-c by respec.t.ive constituent coeffts.; (v) 

-^^-^ c:-effts. to fit * ne calculated lineshape to- the NMi. sp>ettrum 
jf the sariin^e; aric ivi: c ^ 1 -^u . ] t- i n 7 "hp --.-n .',f i^.^^-^t .-^r,o ^ i y- -.^ ^ 

jns 1 1 1 uent as a function "he v'alue "^r i *- =^ c-^^v-^t i ■j-=r- r- 
■t:-ef It . 

AitVAhTAdF - A ir.otr;:d l-'..:' "he ..iscurat':- ana relia;-i..e me a.3 aremen t O) f the 
lip't'prttem cc-nst i tuent s of b:..:.d .s pro:\^i:led. 
AB EQ E r' :■ 6 1 i 1 4 P c i^A B : 19 4 'J 3 ^ 

A m-thod tor measuring the lipoprotein cc;nstituent s ot bl'^'Od, the steps 
■comprising: storing the tJt^P spectrum of ea :h lipoprot^^in constituent as a 
reference spectrum for that constituent; a.tquiring the NMR spectrum of a 
plasma or serum sample or cdcod tc. be analysed; produijing a calculated 
line shape by adding together the reference spectrum for eacn constituent 
m amounts determined by respe^ttive constituent coefficients; adjusting 
the const: tuent ccefficients to fit the calculated lone shape to the NMR 
spe^rtrum of the saraple; and cab:ulating the concentration of at least one 
lipoprotein constituent as a function of the value c>f its 
o^on s t i t ue n t cc^e f f i c i ent . 
Dwg . 0 / 4 
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5275 19850419; 



198604J1; JF' 04j6r341 b Jf 2'^85-b^.:75 i ''^'ri 5(i 4 1 ^ ; JP 04062342 B JP 
-8-270 n^^5041 

FDT JP (■40^-.^-341 P B^£.e:i on JP' o]24:-36:; JP ()4CO.":34.; B B::.sei r>n JP 61243302 
PRAI JP 1 930-8 027P ] --^Si-OAl^ ; JP 1^'80-0P27(. 19:-:504 1:^ 

REP ]. . Jr.l . P-1"; A3...e90:i; EP 1^2>.)21; JP P":i^'07^:4/ :;-:^-JP . Puc^ 

ic g:):m-33-oo 

i'':^s j: ..^.MMi-r.e ; IK027-447; o' l^^J•;3-': c^; o^mTi'^a-z - GO6FO15-20 

AB EE- K^J<27 A [:FAB: 19930-.;^22 

A sign^i: P'ror'-'S r.-T metr.c-:l tor :i6^t ■:'rm: rA r. c laS'"- sr;o;oo:\ :;■£■ dP nucJoio acids 
c .rrvC r iJ'r^: :yur:- i e:t i r^q ai;it^i :^-L::nai;-: i .-i^r.al }: rr :.;.3-:- i r.o . The oigital 
Signal^- •:-.-viesp. an autv^radi -i-iir -uph ro^o:lvod rj'W;:- wOi'-h are fOirmei by 
ro5-:lvi:._: ^ rr..x*:, r^ ii.5 e - , j: vcJ : 1 c- PrJ- lia^r-n*;: r r ^O:- - .et; cc i f i c PNA 
fragmen^.: lac—llea voitn a ii :;-:-t . ^'■r 1 ■;-rT--rr. *: : ■::Jn,-rnai c^n a 1 direction 
C'n ,a suf. r 'ji- t rieoi :rr- . 

Pl'.J:n'--d : :ir.f: r : c a s [r^] d~re':':inj z". i^-a,:! 2 Linj.: in i'l'wei tart of 

e.:i rOi r^v^eJ vea re^w anc nar'oering *:h>.; ]:^:n:n- :■: ns-cj ^: ioo:d ■/ tr.ru. the i':wer 
er.:i, (ha tL^tammg re-ia t : -.-n i:-rtween trie ;.'an.i'3 naiiuter 3na a rescived 
distan-re ior ea-Th resolved rovj ^^nd ;o) ^Tiot e rinini nq <:iLfterence in the 
reaC'lvf-'-d dist^inire between the re:rcdve':i r<.:'W3 fr'..-rr. t:ne resulting cc-rreiation 
and mahmg correction for resolved fc-sitior. on edch row by takino the 
difference as locationai deviatitn of th^v rows fro:r, ea'.:h .^ther . 

UJE/'AL^VA: jTAGE - The baa--- s-:'::|uence ol: tne nu :^e.i. .:: ucids can t:e simpiv 
deterrn.:neu w:J h high accuracy by p r c-cess i n:"? ^h- di. ].:J:a. S LCjnais through a 
sj.gnai pr i.-cessiriig oirci.iLt hiavdai : a function '^af :ibi'.r jf rr;:^k:.na 
the cc:a' r e^'C; LOT; tor I'jirset distortion ,vhen oho ]'os :;l\^e(3 rattern causes 

the offset distorti";n. 
0/6 

PS '::pi eft 

FA AB 

MC oPI: B04-B04A1; B11-J07B5; B12-K04E; PJ2-H1^; iJ)3-B; 

f:oo-E 

bp: : SCJS-E14H 

ABEQ JE 3(:m:.'^ 030 o UPAb: 19030^02 

A signai processing rr.ethod ft.r 'ileterrnining base sequence of nucleic acids 
comprises subiecting drgitai signais to signal prc-.:essinvT . The digitai 
signals cc:rresp. to an autoradi-:) ^ic^h of resolved rows which are formed by 
resolving a mixt. of oase-speci r :.c f'tlA fragments or oase-sciecif ic PNA 
fragments labelled with a raaioa',-:t i ve element in one-^iimens i Dnal direction 
on a sup'P'Ort medium. 

Method comprises (a) detecting at least 2 ban is in the lower r-art of 
each resolved rc-w and numbering the bands ct nsecut i ve y fr.tm the lower 
end, (b) ot.taining correlation oetween tne b.md's nrmber and h resolved 
distance for eacn resolved row and (c) determining 'ii ff erence in the 
resolved distance between the resolved r-cws from the resulting correl.^tion 
and making correction for resolvel position on Or.ch r-jw oy taking the 
difference as locationai deviation ■: f the rc^ws fromeich other. 

JSE/ADVAtlTAGE - The base sequence of tne riuciei.c .acias -an be simply 
■:ietermin-d with nign accuracy t.y processing tne ■rli'^ita: signals tnrough a 
si'ijnal prtcessing cirtuit having a function capar-1-/ 0:" maninti 
the correction for ■tffset tli stcu't it-n even wb;-;n the r •.■solved oattern ''tauses 
tne offset di.stort icri. () 

ABEQ F.P b,^^v-.2"' B JPAB: 1993f:bi22 

A signal pi 0'::';ss in^j metno^i f^tr r./t -i- rmi n i r.g t:;e r:as- se:{uence of nucb^ic 
acids by subjectinc digital sigr::.ls ^Pxr^y) t .: cgnal }:.r dC'-ss ing, said 
digital signals {[t-:3y) corresp'tn li n g t't an a i:t':- r ad : t ;jr.;ipn cf plural 
rest-lve-:i i .twa wni':;n are f trio.;d r-y res-:dvin':r j n r xt : vf a se -spec i f ii': O'NA 
fragments of h:ase-speci f ic PtiA fra-::ments in .-ne din.ensional .direct i'l-n on a 
support medium, to form bands ( r J , saia fra:}men^s r^e : ng labelled with a 
radioactive element, said metht^d cimjirising the steps of: (L; detecting at 
least twt' bancs fn) in the iiwer ^..^rt ol: ea :.:n i--solv-d rtw, and numbering 
all the rands (n) consecut i'/e 1 y friir the Icw^r er.d, .2) trtaining a 
correlatitn between the number f v/nich r-and (n; and tne cor respondinq 
migration distance (y') fir ea:h res:lvea r^tw, 2) d-. t-a rnJ n m g tne 
difference in the migration dist=n::e (y'; between thv rescdvei r<tws frcm 
the resulting correlations, and itaking ctrrectians f^.r the mdgrati'tn 
positions on each r^jv; by ta::ing said differences as the locationai 



''ie V i a 1 1 'J n o f t h e r- o v/ 3 f rc^iw e a ^ h ■ t h e r . 

;':;lgna.. {jro ■•-^ss r.-rthcnl t^'^r nuc.e:c acia 
:ipplit.^d t'j 'ii-gital sigr.al.^ 
I'-.'rmecl ijy I'esolviri'j a mix* . 
with raa.i 'I' .i'::t 1 e i -^ine:; :i. 

.!e.::ot tw-j :>aridc 1:: 
d'-t.--' ::te^i ar.d r.unih'er ed fr^M-i 
r.::d res-:'! -/'Va 
re:?'"' L ved ^ii ;::':ari 

1 ijeterirdri'ird <i:a:: 'M'Tre'::*: .1 1'-n i.? r,/i'i*:i f<ji the r'trsalvea fO'-.;- i t i 
i.y rai:i:;g ^.w^i d.tterence ..1;: i c-: 3.\ C'fi ijev.i a^: i on '"d *:ri- i'< 'V;.: f 
* h^- r . 

/advantage - M-rnod ai:: '^rii.^*: ■ ar.Li hicrdy a. ■^:-a ya^ ^ • . 



.l^^e -;equ^ri(-e di.rterrnn. is 
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of h.jse specdfic OriA -r RtJA fiaam-nts labelled 
one aiinen;: L-jr.a 1 dire-_:"fica 0:1 u .::upijcar: riit-di.uin. 
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;:etwe*:0". ' h^:- re/': !-ved 1 C'V;.: tr'jrri tne re.-aiting cc rrel at .i-cn 
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Ei: EE EE ET Ed LV I-IA ME M'- ME M>] m lA]-: ME dd dZ EE FT EG r.d EL' EE 
EI SK ZL TJ TM EE TT TE : lA VJG ES UZ VH Yd EA EW 
W'o EOalOLdEbr' AE Vj'j ECOE-nElE'dE E'":00id:EE 
EE :E'.'''.^-14 1^.E1 l^^E'd::EEo 
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V:0 EEiilOlEie A dE'AE: E0Eb:^^0 7 

E^ 1 1 V E L T Y - M e t h O'i s , cc-mp a t r p r -o g r .an is o^ n 01 d o:^ t Oi 1: ■ a e s f ^o r . i n a 1 z i n q ana ma k e 
u s e o t e n e h a p i -o t y (Hid i n f rn a t i o n , e . a . to d e t e r m i n to r - e f r& q u ^- n c y o f 
EEo in a por^u la : i ot. , to- find co^rr e b:it i 'On.? t'O-^iwe^'n HTo o^r -^enotyp'O and a 
o-inicai oacconie ana :o predL::t HTa fr-om an (jen^jtyioe fo-r a aene, ire new. 

U'ETAIEEE E-EEGPl?TboH - INE'EF'EdEEdT GLAIME ,ore inclaaeo: for the 
folbowma: 

(i) a meth'oa (MJ f-or 'Tene r i*. ing ^ FIT o;ataba3e f'Or .1 p-o^oulat ion, 
O'Omprioina data -]-rTi-n-o r'-to re sea t a t L v- od HT5 f'or a^ least 1 lonous 

froit. the inolivioluals (in-ii-/;:) in the olata k-a.^e; 

(P) a (M) 'id preoli'Ot.in? : ne pres-'noe od a ET r^air :n an inaiv; 
(/} a ill] for i lien: i f yin ;j a :n:; r re 1 fit 1 on t'C w-i-^-n a ET p i r ar.i a 
■:d Lrd.'; ;j I resfjcaiiiO t O' ^ t r*-eir mer.t , or other pnen^jtyije Pt . ; 

Hntirying a ooo ivr-o I at i o-n t^etwr-en a iiT pair arid 
:)n':iid.:0'n o^r a dLaeaje cd inter^eat (GI), or -oth-oi Pt . 
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3 1 r a c t u r e 



;4) a (M) fo.r i i 
saso'ept Ltoi L i t y to a c 
G )I) / 

■■'•] a (M) od' predi at i.r.a an mdiva responae t: 
p h a r ma : e \i t i 0 1 1 ta • e a t me r. : ; 

;e) -ii O'Omp'Uter impilenente i -Z-J ) d-I) for gen- 
a c r • ■ e r; 1: :> r lis p- 1 a y 'O r. a 1. f) i a y 1 e v 1 0 e ; ED ) ; 

■; / ) a dT; f'Or ■:jen-i-at i-n a HT pair- frequency acioot'Ti f«or display 

-■n a m; 

■ ■: ) a 1-1 dT; for gen-rating a Link age soreen for :li splay on a ED; 

■ ,0 a G-I (of: for genarating a phylogenetic tree s "ri^een fo-r 
display 'On a LiTo* 

{10; a G-I d4) for generating a genotype (Gt.j analysis screen for 
display on a DLi; 

(11) a (M) 0)f disp)laying clinical response ooalues of a subj ect 



popuiati'jn as a lunctii'm C'l HT paifs ot the indivs in the pcpuiar icTi; 

(i::} a C-l {fij t'-j: Cu rry 1 r.r| out. a g':^ne t i. algorithm lor findir.q an 
■aptirnal -et of v;e.qht-:- to fit a turi\-'tiori ot po 1 yino^rph 1 c site data to a 
clinical reap'OruTe me a ir-arernent ; 

(IM a t'-l (tlJ displaying cevfie 1 a t i ons b^rtween oliriioal oatco-me 

vaiues tor a oe..e '*"ed peii^u la 1 1 - ; 

(1-1) a (M- f'j.r- O'^; :dact i ng a 'JIioIojI tric;.L ot .i t r eritarieai t pr-'^oocl 
frjr a m^_-.-iicaI ■ojno i t L'^.o ('jI); 

(1^) a ([!:■ or iaf-rring Gts. of indiv subjects for a se:ltroteu geue 
having p't lyme'i r:lo is sin^:-3; 

(I'-d a [l-]} r^i de"^'^^dning p'0 1 ym': rpni sites cr sub-HTs that rc^rrelate 
uiti": a '.d in i i '-.ip.'O.- r cj': ooj^.e [01-; 

(Id a {M * I: d'Oi-rirdning pcd yrri': ix^ni .rites r sub-HTs that ii^rrelate 
with a ulini'i-.il r-:sptnse ir C'Utc"'me (CI'; 

(Id a ':-'>n.}->u t:'r ...al:de ('t-d; meoium (M<:i.) haviua Cijinpij t e r reofarjod- 
(C-h; pM^grarr, ou:*:- ( rd' ;E:o!:e':i uroii it, for oausing a C'lrnp-iter (d'^nip.; ^o 
adjust ■JJ:^^erv-u i-d pod: f r equenc i e.:: wiinin a population grouo: (tne HT pair 
f requen--^ i es are ;:-t:oreu in a C-P datah^ase of HT in f -j nmat ioTi f-jr a gene r 
jene- f e.a t are : '^d ■ } ; 

{ 1 .d a d-'l M';i. naving C-H Pd st^.ied upc n it, f"':'r causing HT pair 
assignments t'"- t:-.- ma z.o an iriOiv m-mb'Or ot a po}:'':.l a t it^n whi'Se Gt . 
inf 'i-rma- i on f r ..i gen*:- fe-ature (Ld oa st-ji-E d in a t'-F, te'rm; 

(.ru) a i"'-t' f^i'i. r. 3'.^ing d-H pi" st-Mod up": n it, f^:r- c.;iusin^ a C-i'iniJ. o- 
ider.tiry a r r ^ i it i-m o^^weer: a o^irdoil r-isponse : C' a treatment or i::ncr 
ir't . and a HT '.-r id' pair pr*::;Eent it a 'tand i.u.:^ t e Icj'I-u.: asst'Ciuted wi^h 
:lini^ai i: esiu:-]". - ^ '.r .-'her Pt . ; 

(Oi) a d--; dd. nr/in^ d-? st MtEd apu n it, f'.u- u.iiusin; a dC'n.p . : 
identity a r - dit ic'n between an in':iivd3 susceptibility tc :; .rendition cr 
(01) , y ,.:'dier' Pt . , ami a HT :r HT o^^ir pr'r?:nt ■oz a :-a no :.':{a " e 

jth^rr P' . f'tl ; ; 

C. ) a t'-'-' Hi. n^v-in^ u-r Pi* ;:t:reu ucu n in, t :o- oac.sin? a damp, : 
pre<:iiot an in-'iiv;:. lesji-'Tise zo a :E.eui-:;al or jiTiarmareut 1 cal tr-E-atment ba^ed 
on 'one ■: r mc-re s-ie::ted HTs ■: r HT rairs of the indiv; 

(^.■•d a 'i-d Md. naving d-P p; st':vred up : n it, ftr causing a dt-nipu t't 
display a gene's structure and. g-ne features on a display device lTu* 

{!:-]} a f-F M-i. naving C-P P.? ;:t'ired up-t n it, f ■:: r causin-; a 'd-cnp- . to 
disp'lay -tn a PC', HT frequency oata within a popuiat.on of iniivs, f(tr a 
select -.i gene cr ;ene feature; 

(i't) a d-F M.l. na'/ing d-F Pt st-: re^i up :-n it, fir causin-; a din.p. t ■:■ 
disp'lay 'On a I'D, p-o i ymo-rpTiic site linkage data for a gene c^r gene C(jI); 

(j^d a i'-P Ml. i:avin^ d-P Pd st^trei up-in it, for tausin; a dcn:p'. : ■:■ 
dispday I'n a ET a phylogenetic tree; 

{IM a d-P M^i. havings d-F'. F''V st^^red up-: n it, fir causin.; a dc-n:p' . ti 
uisp'iay a dt . analysi s screen :n a DC'; 

(id;) a '"-'.i M':i. navir.^ d-F. F'd ;.-t'ired up'in it, f'lr- causin j a dc-nip' . ti 
iisp'lay ■::linical response values, -ir itner E't . data, of a suojett 
pcp'Ula:ii-n as a li..ncti'.in 'if Hi' p^^irs of tne inuivs m tne p'Opu la*: i 'On ; 

(P'd a C-': f'li. having d-F E-? stored up'in it, f:r causinj a dc-mp' . t !■ 
display -in a FT-, :dini':3l respi^nce values, or C'ther Pt , iata, of a surg-E-ct 
pop'Ula^. xi-n as a lunctic'n of HT p^aiis i' f the indivs in the pMipulaticx. lor a 
;en-- or ;:ene feature f^'l); 

(o"i) a C-r- M'-i. n.jivinu d-F e: stored uoeti it, f ::ausin:f a damp, t :* 
carry ■■.■ut a ■iT-^neti i' al'.iorithni fiu" fin:ling ari op^timal set id' '.veightES t E' f it 
a f -^n::t.ii'n jf p-E) 1 ymorphic site uatE; tir a ET-i-ne or g-E-ne feature ( il ) t e- a 
cli.ni-.:oi resp'E'riSe me as aremer.t ; 

( ':d ) a i'-d M-i. navinET d-P E'd st-E'rei upin it, f ■: r~ causir.^ a r-E-mp'. tEi 
display o>n a Edi, -e: :■ r re 1 at ion retwe-Ti ilinicEil outcon.e; values obtairiod fr"om 
:=electe--i clir.icaL i^utume measutes f-E^r a selected p'jpu lat i on ; 

; .'dd a t'-d Mi. ha'/in:; i'-F. to": Et':reEi uoeo; it, f:'r causir:^ a id'mc; . tE' 
P'rC'ViEie i n f EUEma 1 1 E)n of use in -EUCid .:ct in 5 clinical trials Eif a treatnient 
P'rcitocEii fou^ a me-:iicai conditior. (dl } ; 

d^J) a d-d Fhi. having :>F. E' : i-tErcEi upE:n it, f:.r causing a tE^n^p. t e- 
infer dts. -Ed'^ indiv sunjeits ior a s-letted gene riaving p-Eilymorpnic sites; 

(14) C-U moEiia having d-P PC stE^red upm it, for causing a Camp), to 
Eietermine palymorphic sites or sub-HTs that correlate with a clinical 



resf:oris-- or Dutcome (f.-i), jt jther Pt. 

C':) ::cmps. pr-'T gramme 3 t: carry out the above (Ms) cr co-iprismg the 
aoove ':^jmp . -useable or -rea'Jacle meciia, ojnpri5:..ng a memjry v/ith at least 
I regoir. for storir-j o :>mp . executable PCs and a pTocess'jv ro:: executing 
the r ' 1 s t J r e ■ i i r. t h • :n m r y ; 

■ a data otr-^:ture tov st:)ring ar. organizing bi'ilKfioal 

inf-jrr'i::*:ion, stcue-:i m a C-R :-:d . and acce.s ^; i b ] f^' h^y a f^r-j -escciir, which 
o~3mpri5es a singJe parent *:;irde wnion is ddapte:i t:r st'iuinLj, organizing 
and r '.'t r L^-;ving a ni:fi-er :f :r:neti:: feature:: by the rela*:.ve p'osj.tional 
r-lat :■: nsnipa betv;e'-n r h-r ^•■■n^tij featur^-s; 

'.yn a {M} for -tMing ar.i org.fini -in :f biological information; and 
h-^i ^ iata ctru^-uie ::r s:iri.ng an : igan i. z :i ; .g bir_a..oal 
inf-.:r:njt o.:n, stored :in a -Z-'t Md . jni a ::oe.o: ii: 1 ^ cy a pro;-^s:;cr, v;hich 
:;ratricec a least 2 ■:iirferent fiebic, on^ :-f v/hioh incluo-:-a a number of 
g^-net i feat jres, or; a tne other id which :ncjuc<:-:: lebati'-e r :si tional 
r--iat ninchip.s betw--n ^ he jen-^i: features. 

d'Oe: Eurtnei oetallc of th-r aoove are ^iven .n the sp e ri f 1 ca t i in bu 
nad to be orrdtted frc.m this at^st r a-:.:t due tc- insufficient ot^ace. 

fcFE - Tlie metloo'ds, cionputer jrri.^ranLS ano <:iata;:ases b;r analyzina and 
nake use ...f gene hap.^otyF:'e HT ^nf i.rrtation, e . . t<i oetermme the frequenc 
'-.■i HTs :n a population, t !■ tmi :i.r r elat ions b^etwe-^n HTs cr genc-.types and 
f: ciir.i.cal outcome (or tnt- e::fe'-:ts a t hc'i a}: eut i-:^ intervention and/cr to 
f r^idict HTs frorn ar: ^nuvi -iua^ ' s ci:i:ty|:e icr a gtnc. 
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FA AB 

MC t-pi: B:)-^-£01; Ell-COyE:; Bll-CtJ; ElJ-KC-lAl; bC)5-HC_; D05-11C';^; D05-H12; 
roi'-iilB; D05- J 
FP'I : ;o)l,-B14H; TO 1 - Ji") h/, 1 ; T01-S03 

L127 AiiswFr. ^: r wp:>; tv)ivT.;T;';i; ; joo; pB'-weijt iiJF( hmaticij ttij 
An tocjCi-.;;!: tG''2 [03] wpii; 

BUN I]l;000-'.Jl9';'43 FdJC CT()(')0-0'J6875 

TI Transitior:/transversion ratio estimation luethod - for purine, pyridine in 

BNA involves creating a taxoncTiic tree from mout tase sequence data. 
CC BO 4 Dlb TOl 

PA fSEIB-N) SEIBUTSU BUNJHI KOGAPCd PvTllPvTUSHO KK 
CYC 1 

Bl C? li:S'^'"o:0 A 1^*990924 (SOOuQ?)^ JA l8p G0tE017-T0 <-- 

ADT JP IITI.^TTO A JP 1998-307740 1 '"r^ 9:!^ Ci 92 5 

PFAI JP 1998-::5313 1 9980112 

IC ICM G06F017-30 
ICS C12M015-00 

AB JP 11259530 A UPAB: 2 00i")0118 

IJOVEhTV - The base sutiSti tution nunber assessO'il frc-n type ol: 

t ransition-transversion is to .:::reate a taxonomic tiee. The h^ranch lengths 

of taxonorr.ic tree are supposes. The r^resurred type ■ii 

t ransit ion/transversion rati'i rs apjflied and assess-.'d value of: 

transit! on/transversion ratio is output. The ijase suristi t ut i on numk^er is 

assessed using a predefined equari-n tha: rebates to corrp-ared sequence 

number . 

BETAILED DES'IF.I F TI'-U - Th^r tase sequ-rnce data with mare than three 
sequences are input to data prc'cessi^r. 

An I::DEPEMF)ENT ■TLAIM is' also ihcludeo for crer^ting a 
phylogenetic tr^ee fri^m the actuai case sequence -iata. F'.eii'rateii 
calc;^lc:it ions are done using tno maximum evil at ion oistance and tne result; 
are converged oy a specified ir.etlicd, to ah'tain a ri gnt tree form.. 

U2E - For estimation cf t i ansi t i on / ti ana vers i m ratio if compinents 
such as f'Urine, pyridine in .:ie . xyric-c^nucle ic acid - jNA) using their :oase 
sequence data e.g. of mitochondria if human and chimpanzee (claimea). 

AF'VANTAGE - The transit ion/ transversi an ratio in controlled dcmains 
such as human of mi t ochi ndr ial DNA is assessed ciiriectly and the taxonomic 
tree is alsa built ^irrectly. 
F'wg. 0/6 
FS CPI EPI 
FA AB 



CPI: B04-E0i; B06-D09; B07"Ui:j; B11-C08E; B12-K(j4K; D05-H09 
EFI: TJl-JQSB 
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FILE 'MEFLIlJE' ENTERED AT 14:57:21 ON 1 : : FEB 2001 

f^ILE LA2T NPPATED: 2^ 02T 2C00 :2r.:X)102-/[;F) . FILE COVERS 1:^G0 TO DATE. 

MEDLINE rh'i^sauri in the /CN, /CT, und /'MN fields incorporate thie 
2:eOH 2^;; ■) v- .-ah^ular y , Entfix HELL IHEO/OIRUO ror oetaais. 

The 2LDMEDLINE file £-egnnent now cc-r.tain^: data frcim 12:8 through 1265. 
Enter HELP 7.01nTENT details. 

Left, i:-jh^., and sirrjl t ane<..ijs leit ana ri.;^Ft trunoati':n are a\'ailal:de in the 
Basi: Index. See HELP SEIELEiS foL details, 

THIS FILE C'jNTAINS CAS REGISTRY N^OMBEF.S F(jR EASY AND A'Y^ORATF 
iUJBSTAN'y-E I DENT I FICAT I C-N . 

MEDLiliE VPL'AIES ARE 'jN HijLD UNTIL AETEF THE ANNEAL ?.EL2^AD HAS EEEN 
OOMPLETEri. N'"iTI':^E WILL BE GIVEN t'NCE THE RELC-AF IS CCXPLETEr' AND 
.■.ELOAL' r.ETA-ILS VvIIiL BE FImINL' IN [;ELF' PI.CAxL'. 
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L257 ANSV:ER 2 2E ^3 I-^ElLINE 

AN 2':^'-^^--";8 24 -i'^ >;edline 

ON 2 '0F 24 2G 

TI Br.O_:.-lir^ dC'mai.n proteins: update an<:i new insights ^nto a rapidly 

e:-:panaing family of proteins. 
AO Henry C; Mather I H; McDerraott M E; Pontarooti P 

CO Centre National de la Feoherohe Scientifique URA 14B5, Fac:ulte de 

Medicine, Lirnoges, France. 
S(2 MOLECULAR BIOLOGY AND EVOLUTION, (1998 Dec) 15 (12) 2696-705. 

J'-jurnal oode : M'2B. ISSN: 0737-4038. 
CY United States 

DT Journal; Article; (JOURNAL ARTICLE) 

LA English 

FS PriC'rity t'ournals 

EM 129903 

EW 12990304 

AB The B30.2 d'jrr.ain is a conserved region of around 1^2 andno acids 
assc-riated with several different protein domains, including the 
irrirr.unog] ot uJ in folds of butyr-jr-hi 1 in and the RING finger ■domain of ret 
finger pi 'itein . We recently rer-.jrted several novel irieniters :'f this family 
as well as p'revic-as ly undescrit^-d rirotein families cvjssess ing the B30.2 
d'jmam. Many proteins have sucis-r-quent ly been founa t possess tnis domain, 
irioludin^-:! pyr in/marenos t rin and the midline I (MlCd ) protein. Mutations in 
tne 230. J domain of F^y r i n /ma ren-: s t r in are impli(-ated in f^^milial 
M..-.:iirerrar.eari fever, and partial loss Df the B3(i.2 domain of MIDI is 
respcTisit le for 'jpitz G/3BB synorjme, cnar.ic ten zed ;:.y developmental 
m^jl^ne uefects. In tnis study, we scrutinized the avaiiarle sequence data 
b.-.-^ses fc.r the i dent i f i o^3 tion ot novel B30.2 dimain prc'teins using highly 
sensitive dat ao'ase-searohing t: <j1s. In addLti-:-n, we discuss the 
chromosoir.al localization of genes in the BsO.i family, since the encoded 
proteins are li}:eiy to L^e inv-t^ved in other- forms of ^'eriidic fever, 
autcinimurie , and genetic diseas-zs . 

CT Check Tags: Animal; Human; Suc-tort, Non-U. S. Gov't 
.^inir.D Acid Sequence 
Autoimmune ['iseases: GE, genetics 
Cat:le 

ChijmosC'me Mapping 



Databases 
♦Evolution, Molecular 

^Hei^editary Di::ea;ses: GE, genetics 

M«:Je:uldr Sequence Data 
■* Mai C igene Fami 1 y 
-PnyLc^geny 

*Proteins: CH , chemistry 
♦Proteins: GE , genetics 

Sequence Alignment 

Sequence Homology, Amino Acid 

Zir.c Fingers 
C'N -J ( E'r-i'teir.;: ) 

LI 5 7 AIJoWEF. 2 OE . :' MElJLirJS 

DN ':^4 0'}^ 

TI Ti.r^:-e ri<jve.I f'.rr.-e i ns (.>f the syntaxin ''SNAP-^B family. 

AU oteegniaier M; Vang B; Yoc^ J S; Huang 3; Shen M; Yu 3; Luo Y; Scheller R H 
CS :'--'f:'ar^ment oi ricle':ular and Cellular- Pnyai^'logy, H'jward Hughes Medical 
Tn;:-t it ut e , otanlcjid University Si^hooj. of Medi^:::ine, otantc-rd, Calif'jrnia 

■:'4 ; V^-5:^4 t, U^-h. 

;;0 J'l'JPNAL C'E r IfJhfji-; h:AL CHEMISTr.Y, (1998 Dec 18) 273 ( ^- 1) /417;-^. 
CY i.'r-iited .;>at:es 

jT 'J .-ar Ti.ii a. ; Ai'ticie; ( LiC'MRrJ/H. AF.TTt'LE) 
LA Enghiah 

FS hr.i-:-rity JC'arnals; Cancei' Joarn.iiis 

OS I'^ENEAJJK-AEl 1 : ; CE:-;hANP'-AE 1 h' 1 : 5 ; ' -iEMBANE-AE hi. 5 16 
EM T.^'"^^^0:: 

AB ! at r.j; :e hlu.h5,r rv^n^j l a:\*-^ tiaffi<: ic th-jught :g be reguiateo m part by 

.:.-..duLde M--t re: hriiti i:ie-sen.5 : ^ ive f : :.ct or-at L .i-^himent prcrein receptc.rs 
ih2:Ar.Es' tnr'.'-^(::h z^'.e L 'jriri..! t ic n ot ctjriplexes bervioeri tneae prc^teins present 

vesi'.::ie and t.:irger meirbranes. All known ElNAr.E-iiiediated fuaien events 
iriV'ilve meiTX'ers ef th^ syritaxin and ';e3i':::le-aasijciated membrane prc'tein 
fartiiiies. The diver sity ot mait'j\alian memk'rane -i^Limpjartment s predicts th^E- 
existence ef a large numtier ot 'ilifferent synt-ixin and vesicle-assO'i'i ated 
merrihran- f-r C'tem ■_:ie:ies . T^o furtner iiivestigate the sp^ectrum <tf SNAP.Es and 
their r':des m mertih'rarie t. raff i<_- king we characterized three nC'vel meniber-s 

the syntaxir: an:i SHAP-.'S ( s;^T:aptC'S':'me-asscMji a ted f^rotein c^f 2t KEa ) 
sut'f ami 1 ies . The piroteins are brc^adly expires se-:::, suggesting a general role 
in \^esicle t raf f i'rKin'"4, and l^i'^-alize to (:iistin::t membrane compartmer.ts . 
Syntaxin 8 co-iocaiizes with m.arkers ot the endoplasmic reticulum. 
Syntax in 17, a divergent nemk'er i: f the syntaxin family, fiartially ^jverlaps 
w^th endoplasmic reticulum markers, /ind SNAP-2'^ is broadly localized on 
maltitde miembranes . SIIAP-29 d^^^es not ::ontain ti p'redicted mem.brane anche-r 
characteristic ot Lather EUAREs . In vitro studies established that 3NAP-29 
is rsit-able of binding to a brc^a-ii ran':]e of syntaxins. 
ET Cneck Ti^s: Animal; Human; Support, Mon-U.S. Gc>v ' t 
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TI .:)'^ "nieric-^ (X'n.par iscri^ usinq rault iplf^ sequences detect three times as manv 

remc'te h^jiTi-el ogues as pairwise niethcK:is. 
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AB Tne sequences of relure^:! pr'i.r.eins c-;-:n diver-::^ b^eyspod the f'oint where their 
relationship can be recognised by pairv/ise se:^uence comp-arisons . In 
attempts to C'vercome this I imi c.^t ic:n , methotis have oeen developed that use 
as a query, r.t^t a single se^uen::*:, i::.ut sets tf relat*;:d sequences tr a 
r-i: r'-^-sent ati tn of" the char E;cce r i ct i ss sh^^reci oy rel-ted sequences. Here vve 
':i';isr:.r i oe an ussessmer.t of r ri^^ee oz tnese ni'^-tnjds: t rie SAM-T98 
..j'u lement ati':jn C'.f" a hidaen Markjv nc de 1 p'rt-?-:' ::ure ; E^SI-PtASlv ana th-:' 
intermediate sequence search (IPS; prC'Cedure. VJe determined tne extent to 
v;hrch these pirccedures can detect e vcdut it.n.u r-y re la t i Dnship^s r'etween the 
members of the sequence database Pti&P40-J. Tnis dataL-ase, deriveo from the 
structural c.l assif icarifjn t'f pr ot eins (SC'jP), contains tne sequences of 
proteins of P.nown stiucture v/hc'se sequence identities with earn ether are 
4 '.)'■: or less. The evolutionary relationship's tnat exist between those that 
have low sequence identities were fcund h^y the examination C'f their 
structural details and, in many cases, their functit>nal features. Eor nine 
false positive predictions out tf a p'Ossible 4 32 ,680, i.e. at a false 
positive rate of about 1/50,000, SAM-T98 found 35^ of the true homologous 
relationships in PE)BD40-J, whilst PSI-3LAST fecund 30": and ISS found 251. 
(')verall, this is about twice the number of POEE'4 0-J relations that can be 
detected by the pairwise C'tmpiarison proceoures FASTA (17-6) ami GAP-BLAST 
(15s). For distantly related sequences in PE'3D40-J, those pairs whose 
sequence ider.tity is less than :0'b, SAM-T98 and PSI-BLAST detect three 
tiiTies the number of relat ic^nship s fc-und by tne p-airwise methods. C'tp^yright 
Academi't Press. 
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P.P P. r :j i t: i e .5 en*, e' ; t ■: o-.:--3r s i or. : 1 1 .:.'ari^:e oi. rdinq i r e 

.: i lui 1 .^i' 1 : i e.? w.i.thir: :ojoer i rr^r^- o*^^-:i [j.roteio t hree-dirierioi c.nal ;3P) SL.ru::our. 
.^r.a ^ppd i-d Lo ci]t;i.iar ^ r u ^^r ures 20 the ?r<joeiri Pata Ear;k. Por 

.1 iiui PiiT i ^ i h>e': we*:T. :P s"r^;-rureo .lacking o :. gn i f P car.t oequeric-i 
.i iiu : PjT i t y , the ^mr^C'r r nt di^tinct.ion was m.ide ijotween .remote noniology ic 
ar,':oerit o-Mrancn anc-otcr} and analc-gy (.likely convergence to a folding 
ir.': titj arrcicmg tC' the structural L:la::.o i float .1 -on of prote.ins (SCOP) 
d.ut aP-.^se. Pur^ersi t*.: s were defined as s+: ru(::t oral .LC'Cati'jns on q.roaps of 
an.n I'jgO'Us p^r'-.teins (i.e. ^up'Oi f 'j.Lds ) sh-jwing a at at is t i cal 1 y significant 
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ten(":er;<:y tt- rdna sot s 1 1 ites oesO'ite little 
for ^he r^r-itc-lnc ct-no 1 '::;e red . Vve Pdentiiy 
O'. n t i:ln: n^.: sup'Or s i t c 0 : f e r reuc'>; i n- 1 .i ke folds, f r - hel i ca.l bundles and 
'j'l-uP ] s: r ansied be^-.a neli:-es. In adait.. s>n, one m»:-:hod qiiantifies binding 
oit>_ .oi::d i T r i ^ lea within i.oit.- IC'OOUS prcte.i.ns and previously' identified 
^'.:r;'-rsi^ es sucn as that f'Ouno in th*E- b^vta/alpha (Tlrn) p)arrels. Por the 
nlr.e sup'O r f cd ds ^ the --rruraoy c: |.^r':■di t i one of ijind.ing sloe lo::ationc is 
asovsaed. Tiucd b.O:i : 1 c-ns for fcrc.tem evcviuticn, anri -t.p r»redlotion of 
j'r'j^'Mn fun<:-:.iL>n either- tnr':i.:gh f<jld i :oqn l ^oi -on -*r: ^.er-flary stru(;:ture 
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1 n t e r f ei'on indue e'i |jr:>teiris, pr e.sumably pr*-:^ s t-n t rdir (jughout th(; marunia .1. iaii 
^:'l.iS.::. Using the Sxp^ressed Sequence Taci aa L ar^a ses , v;e h.^ve identified a 
n'T'vei memt-er of thi:? fan.iiy. This prc-t^-in, wh.ieh we nanuai p59 2 '-5' 
o 1 ig^^adeny Late synt h*:?t: ase- L i ke protein {pb[^Ok'JL), shiure;:- a higfily 

unserved I\-term.nal. ocir.ain v;ith th*:? knov;r. terms of r) 1 i gciadeny 1 a t e 

::' yn" net ase.5 , i:-ut dilfers O'jnipletely in it::' C- 1 e'rininai port. Tin-- t'-terminu 
■jt \ -i'^f'^AcL is f 'timed f tw-j domains c^f uij i cju i t i n- 1 i sequer.ces. Here v;e 
re exeri" trie cnar act er i sa t i': n of a fail- length cDMA clone-, thiO (jenc-uiic 
.o-qu^--nc*^ ar.d ^:hie exjieres .:.i-j:". j^attern t-f this geite . VJ^ h^ve a^ldrest-ed the 
:d at iC'H C'f the .i^'-5' s 1 i ooaden yia t e syn*:hetase gene ft^mily, in the ligh 

s;."i*:a fioi'ir. C'tncr ;".jjer:i.es , '.vr.Lcn have ie!:'(-:n:iy aispe^ared m tht' dat,dl:>a.s 
dn-'-K i^igs: An lk;..: 1 ; Ft-maie; Human; Mere; oi:pr::o::: t , hon-tdS. t;ov't 
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Tl Eiar.'tt ion driven prc-tein evs-lutisn. A pC'Ssible prc^to-prc t ein fc*:: th*t 

r h'A-ii' inai rig p»roteins . 
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Ah v.'- :ritro :luce a hyp'-tthies i s rnat pre3er:t 'lay proteins evolved fr ^tm 

"pr t ^-oi-pro^teiriS , " tmiill 11-27 residue f:e^pti:;^;s witn s tnie eLemer^ts. of 
s^-c<-.'ndary structure ana primitive fur:cti.on. Increasingly stable an-'i 
functisir.ai proteins art^se r>y adiing structural elements tt) pr:)du-:os the 
small d'jmains or pjrotein msiules that v;-? v;ould r^scc^gnize tD-iay. Ero^m, this 
cs'int of view, tne sarprising s imii 1 a r it ies t'etv/een sniili struct ureal 
fragments 'jt large proteins, that ar-a usually taken as examples of 
c ai-r.'er gent , f uns:t i sn- :ir i ven evcdation, are interp'retea in exactly *:he 
:r)rc:site way--as t^ates of con:imon e v j lut ionar y origin. As an e:cample, a 
hyp'Othet ic al evo Lut it;nar y tree for tv/o famdlies 'Cf RNA binding proteins, 
the t)B fold, a family of all beta proteins, and F.Bj fsld, an alpha 'beta 
prO'tein family is presented. We argue that both prstein families could 



hdve evolved frcm the same RNA-bi n-:n ng proto-prot em, which had a rorm of 
bet 3-.Ioop-fcet a F'.NA binding rnotif. 
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Tl E'rotein clisor-'ier and the e\/olution of molecular recogniticn: ther^ry, 

p^redictions and observations. 
A J L'unker A K; Grrner E; Gjil^ict S; Romero ?; Albrecht K; Hart J; Obradovic 

Z; Kissin-uer S; Villalranca J E 
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AB Observations going back more than 20 years show that regions in proteins 
with disordered backbones can play roles m their binding to other 
molecules; typically, the disordered regions become ordered upon complex 
formation. Thought-experiments with Schulz Diagrams, which are defined 
herein, suggest that disorder- t'j-order transitions are reguired for 
natural selection to operate separately on affinity and specificity. 
Separation of affinity and specifi^-ity may k-e essential for fine-tuning 
the molecular interaction networks that comp'rise thie Living state. For^low 
affinity, high specificity interactions, our analysis suggests that 
natural selectic^n would parse tne amino acias conferid.ng flexibility in 
the unoound state from thc-se conferring specificity in the bound state. 
FC'r nigh affinity, J.ow specificity or for high affinity, miltiple 
specificity interactions, our analysi.s suggests tiiat the d is'jrder- t'j-order 
transitions enable alternative packing interact ioris t-etween z.ide chains to 
accommodate the different binding targets. Disorder- t o-order transit ic.ns 
uton binding als^i' nave significant kinetic impli.::at ic^ns as well, by having 
complex effects on kioth on- and off -rates. Current data are insuf f i<::ient 
to decide on these p^rop-osals, but sequence and structure analysis on two 
examples support further investigations of the role cf disorder-to-order 
transitions up-on binding. 
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TI L-etecting distant relatives cf mamLmalian LPS-r^inding and lipid transport 
frc'teins. 
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AB In mLairtnials, a family of fciur lipid binding proteins has been previously 
dotinel that includes tv/o lipop'Olysac -har ice binding proteins and two 
lit'id transfer r rote ins. The first member if tnis fam.ily to have its 
three-ilimensional struct ui e det ei^mined is oacter icidal /perneabil i t - 
inrre,-::sing piotein (3PI). Using both the sequence and structure of'^BPI, 
alcng with ie::ently dev^;Icp;ed sequence-sequence and sequence-structure 
similarity search methods, we have identified 13 distant members of the 
family in a aiverse set of eukaryotes, including rat, chicken, 
Caenorhabditis elegans, and Biomphalaria galbrata. Although the sequence 
similarity between these 13 new members and any of the 4 original members 
of the BPI family is well below the "twilight zone," their high 
sequence-structure compatibility with BPI indicates they are likely to 
share its fold. These findings Proaden the BPI family to include a member 
found m retina and brain, and suggest that a primitive member may have 
contained only one of the two similar domains of BPI. 
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; L'--d dc'ir.ain tarniiy, which in::: lodes the CI and C.2 
o]: faovoro V anci Vill, occu:r in a greao variety 
t euka.ry :■: i -:: prcteins, mC'St ^d" ohi^-h. nave been imp! ic:a t Oii in 
- hi -::id::es i ^:oi or ^:;ovo hi^riinent a 1 poC'OL-sses. 3o far, n:o t hre^ -(i irnens i c nal 
structure- a kri'Swn exampi':- of :d".is extracellular nioduie has been 
(;iet e r I'll neu, liirdt.ing the us ^-i f o In^rs s ct idenolfying .i:: r.ew sequeo'te as 
lu-n.ber ^'-f tnis friirily. Her-e, v-^ i 'resent results of a ro^'coit sc:arcn of she 

r'ttein sequen::e catartase t':*r nev; I).: d<jrnains using general Lted pr^: files, •:^ 
ser:Si* i\'0 :t;\. i t. i ' e a I i gnmHOi^: - ba = ci .ota.rch techno que. Several cuevic'Usly 
vu";. r O'S'-'gr: i 2->"a OS dtinai.ns could be- ij;it.uo.d :f 1 ed by this rrieth-cd, in^j'luding she- 
first en.iirpdos f.r:ir. r re ka ryt/t 1 'S .-; Cieoi e:t . Mere import antd y, v;e piresent 
s u^it ist ^ ca i , stiusturai, anu tunoti'jnal evi(denc:e that the Dl aomain of 
ga l.ict C'Se s-xidas- whose- three-diniensi onal structure has been determined at 
1 A7 A resolution; is a dist.-int memr-er of this family. Taken tc^gether, 
these findings significantly expand the concept of the 03 domain, t-y 
-.^tenuing its raX'Snc-mic range an^i by impilying a fold piredi.ct ic^n f'Cr all 
its memt'Ors. The frs-rosed a 1 i <"4ni^it-nt with the gala-::t'jse oxidase sequence 
makes it pc-ssible tc cc-nstruct h'^mol ogy-based three-dimensional m.S'dels fc^r 
the ms^st interesting exampiles, as illustrated by an accompanying r-aper on 
the CI an-i CI domains C'f fa^-tor '/ . 
CT ':^heck Tags: 3upf>:ut, Ns^n-tEC. elC'v't 
Pji I i n Ac id 3 e g u e n c e 
^Bacterial Proteins: CH, chemistry 
Bacterial Proteins: CL, classification 
Databases , Factual 
Evolution, Molecular 
*Fungal Proteins: CH, chemistry 
Fungal Proteins: CL, classification 
' Ca la'ttS'Se Oxidase: CH, chemistry 
Cala'ttc^se k)xidase: CE, cl ass i f i tat i c-n 
I-^'Cl e-;ul a r 3equenc-e Cia:a 
Pr<ct-iri Eolding 
PrL'teiri 3t ructur^s, Tertiary 
Sequence Alignment 
Sequence Homology, Amino Acid 
:N EC 1 . 1 . 3 . d f ■■■.;alast cse 'jxidase); J oii scL)idins ; ; 0 (Bacterial Proteins); J 
fpur.qal Proteins) 
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Phy] c> jenet i s relat i'Cns.nips -jf fungi, clantae, and animalia inferred frcmi 
honu':l gous comparison of rioosomal proteins, 
v'euthey A L; Bitt: 



CS Defjartement ie Biochimie Medicale, dniversite de Gen ^ eve, 3voi t ze r 1 an d . 



J'jlJFNAL OF MfJLFCULAR FyniJ^TlOU, (1998 Jul) 7 {!) ^^i-^/. 
Journal .rode: J76. ISSIJ: 'rVA—^H^M. 
CY Ur.it 3: ;jC-s 

OT J-urnal; Article; (jnuioJAL ARTLCLi^:) 
LA English 

Pr ic-rity ma Is 
^'A 1 ' ■ 

A^^ Th.-> o-rrifrl-rie set of: .i v.'i: lab.i.*: rLl:o)Soiaal t'lotein^ was utilized, at both the 
p':-} o .i.'.tLo iir.ri the nuc i ^"^ id i level, tij tabliso that plarit:^; and inetazc'aris 
f '1 tn. " v/i.:' odster <:ladv;? reLar. .ivo^ t^j fungi. Liilfeient phyl-jgene t i c inference 
nLeth<'..-is are at'CiIied ^ *:n^:' :?Hqu.":'r,.-f-^ d.^ta, ■_,.;:onc- ar -:r.^z ixr.s r^:- the L.utcroup. 
T:".^' ^vvcd a t 1 itnar y Lt;n'-.itn it f *:.he ir;ter-nal l:^ranch withi.n "he- ev; oar yjt i v.- crov;n 
tro:o. Trrry io dero^n;: r..^ t c d t C' be, uL iriost, ijr;e-:entr: oi ""he e volut itoiat y 
lentrth tf : :te t-ranch b ruoinu *:.o 'C'le cer:arc'=- 1 1 ct of th'-;te ti.i^e <in(j!".:oTTio . 
■ "] Cr.^C', T.i'::;:r : Aniitai; i.'t-ri^'.j ra 1 1 ^/e ttuay 
'Aniro:l.f: dh, cd <3 a a i : i < 'aO .oji'i 

Computer Simulation 
♦Evolution, Molecular 
* Fufj.ti : Ch, ol. a:r:-:":i fioU'. ii:'n 

Mt'Oels, Genetic 

ohy^oqeny 
' ? \ ar.f a : t^L, td.iotof : c<r. i r>n 
*Ribosomai Proteins : GE , genetics 

Sequence Alignment 

Sequence Homology, Amino Acid 

;:Vquerioe H'tmcd c ny , rtuole-ic Acid 
Ch ij ( r i bt'or.rr a 1 E'r C'teinc: 

AhhWFP 1 1 t'h o ? MZLhlbb 
AN b:^ ^ a _ t : Mb L 'L I ! "£ 

T I Tr.e ^"o-:t an^ i i ng be^. 4 - ua J ct t-a ;^/l 1 1 an:i f t-r ac e gent famij.y: ineaaages frtin the 
dot aL..zinka . 

A-; L:- N W; j:hap:er J ii; hevaner d; dhaper N L 

Ci* i:-/p.ar tment 'id: E'ha rma-o:doi'4y and Idtdecular Sctlencea, ?:enneoy Kriea^er 

Inatitute, Ba i t i H'O're , Mb, bbA. 
N': CA4 57'^:^ (MCI) 

■30 ChYC()Bb:JbCGY, (1998 May) 8 (5) 517-2 6. 

C-;urnai C'tde : BEL. ICCK: 0 9 1 9- ,5 ^5 . 
CY EtlGbAlIf': ^initec binq..t:rr: 
jT d-.urnai; Arti:-b?; ( J' ;CF,NAL AbTl CbE ) 
LA Engiiah 
F3 EiiC'Tity ■.Tc:urnals 

03 irE:iBAlJ?:-AbO:-:8oCC ; i:;EdBAMK-AFO -IH tel; i^ElIBAN?:- AFU 13 662 ; OENBAMK-AF0 38 6 6 3 ; 
GEdBAMK-AbO 8 8 6 64 

EM b'^;'r*8b:; 

AB Fr--'n' a 5-yot err^at i c ::a£-arch td" the r'nitiene and dbECT a<atal:anos, using human 
b-'-fy-j 4 -ga 1 a-:t':'ay 1 1 rar.s f ei- joe (be*-^ 4t-;alT-I}, which l3 i ecogrd zed to 
fun't^i'tn in lactoae t-i t-synthes la , as the guery ae(qa*;!nce, we have 
iderititdei five a idi r. i-jna i gene tamid.y members oenca.ed as beta 4GalT-ri, 
-III, -bv, -V, an 1 -VT , tb^mp I'lnnenta r y DNA tlones ccoitaining the comrlete 
:''j-.iin .1 re-iU'Dns for e^i^.L of the five nuirh.an hcimol'tga '^lere tltt <-i me 1 'tr 
:v;uierared by PCR-t-.-ised strategy (rlACE) se'quen'::ed . Feb;itive to beta 

4::tib:'-l, the per-cent s^rguence id^-ntiiy .at tne limine* atid level between the 
irilividual fsmily mem!:i^-;rs, ranu'^^^- f r jt) : (beta 40:.ilT-VI) to 8 : (;tets 
4 1; :i b!' - : I ) . Tne h i : h e 1 1 s -t q ti e n : e j je n t i t y hi e t vje e n a n y d f tne f 1 Din 0 1 1 g s i s 
b^twe--r. t^eta 4GalT-V and beta 4GaLT-VI (6 0.). t-eta 4GeilT-II is tne 
:.'rt r.t> . :.) q :)f the trhiC'ien bet.a 4GalT-II gerie, whi tn na? t'oen demons t r :i t e a to 
-i^-:o-ie an a Lpn i- bict i lo>umin reap'jnsive ;j^'ta 4-QS:la::tosylt i ansferase 
;3naper e^ al., J. &:.:d. Cnem. , .072 , 81 :0 ^-81898, 1 8 :W ) . As establishe:i by 
M-:>rthern analysis, beta 4 8alT-ri and - b/ show tne niost restricted pattern 
■:-d' ti-sue expressi.tn. Hign stei^iy stite bevels of k^etEi 40alT-Il rtONA are 
s-;.'eri Mrd.y in fetal brain and adult heart, mjscle, and pancreas; relatively 
.nigl-i levels of beta 4GalT-VI mRtJA are seen only m adult brain. Wnen the 
c: jrresponding m^use E3T cltne for each tf the beta 4GalT fatdly memk-ers 



was used a^^ ^hf^ hybi idi zat. icr: prcb- toi ;..;:ai n cir^a^ysis V)i murine 

I'lammary ti;:sue, t >'aris cr ipt i of c::ly the beta ^IGu:. 'i'-l qen-:- ::ouId be 
dete'jtt:-d ir. the iaetatiny rujirmiary qlar.d. 7'hese ebs^r- Vat ic-ri-i support the 
■ :onc iu;;-- 1-:^ t.hat ari-jn<g the :.ix knciwri b-;-ta -lijdLT family merr.b-^'is in the 
rn.irranaliari qencme, that hav^- h^een generated thr<')uqh multipd-:' qene 
d\:pLication events -j f an ar: ;-est. ra.i. qer.e(s), only th^v bet:a -''^-liT-I 
an-.-est i iin-aqe was recri:ite.:l tov L5:CtC'.:;e bi c-s ynt hie i s aaring the 

-L Jt i o:": o f" lu.nircn.n . s . 
bh-cr. Ta'js: Anirna^.; Human; buppc-rt, [v.^n-.b^. G<:'v't; b.-ppc^i':, U.G. Gr,v't 

H . b . 

A iu : t 

A:-ir.~ A:i':i 3-i;qu-n ::e 
♦Databases, Factual 
Evolution, Molecular 

r'-rtU 

! t'l b:iet oa;/ 1 1 rar.o le r isea : 3 : , h- i t .synt hes.L.= 
*'i bii.:i':-t 'I'.-yy I z raris te :; ^se.^ : Cn, rr.>z:iii. ;ti y 

[•l:lecu:ai be-gueno-' L'a*:a 
'd lult ia'-;-ne E\aifb iy 

F'^' Lyme r a se bha In t.eric^ion 

Recombinant Proteins: BI , biosynthesis 

Recombinant Proteins: CH, chemistry 

■•.iJA, H^;-:: ::~enqer : Mb, iriL t ab^ d i siii 

Sequence Alignment 

Sequence Homology, Amino Acid 

':'ran:r'M:ipt icT., ^b-^e-ie. 
bt' .b-3.1.- (bola--;t..:syItran.--fer.:i5es ; ■: ( r.ecoiro i nan^ rroteins;; 0 (RMA, 



AbbVJb? ii t'b .1 [^bbj],IKE 
i ( ) ':i :■ t, 

bhimeri'.T .-.t lue-ture the ]'.AV{?) ^- and NAbFt-depiendent rr.alic enzyme:--, ef 
f hi Z':'bium ( binc rhi :M'bi urn ) neli I t'ti. 

Ibts-h M :■; V:eq'tl-: b T; CL-wie A; Oaterac H; r'Lnan T K 

r.:ep'artment of Bitd. :qy, McMaster Uni versity, l^rC F-lain btre^;'t VJest, 

H..in.i ItCTi, (jnt .":ir io L::b 4K:1, bana^ia. 

.TftqrblAL t)F BbqLtntPAAb (tHEMISTFbb (1998 Apr 10) 27 3 (!:} 9::-0-6. 
J.^urnai e.^ae : HIV. IbSN: Oiibl-d2=b3. 
briited bt.^tes 

b. 'ur-ai; Article; bJCdJRNAL ARTb:LE; 
En g l 1 sh 

PM<:,rit j' :''turnaii'; bancer .T-iurriiis 
1 V* ■> 7 

[■tjli: en:::yrrie3 c^^talyze the ':.-xi':i "::t i\^e -uecarbC'Xy 1 at i en :f luabate to r.yruvate 
m ■.'■i^nj an :t i. tn with the i^'i.^cti:n f h n i o.-jt inami de c:far't':'r. We 
'-:».'t r^^.in■?:^i the C'b'A seqaerie-z- an^:! t r.-inscririt i t n.ii 1 st.iirt sitea of the uenes 
eno':-::] nq the dirih^'Srih^ipyr i dine nuc leot ide-dep-ndent m^lic -nzyme ( bME, EC 
1 . 1 . 1 . ri^M and tne t r i phosph-i-fiyr i dine nuc ietit i':ie-depenaent nialic enzyme 
brME, Eb 1.1.1. -bJ) F:h 1 zc'kauru { : ; in^jrhi Z'^ljianL) nieliloti. The predicted 

:blE and bl-lE prot-ma .-.^ntarn and 764 a:iiinM acids, resC'-ct i ve ly , and 

ar^-- ar'pi" 'nimat e 1 y bb> aniinj ac i ::3 iai aer tn^n ptrev i 'jus ly cr.a r acter i zed 
ra- t J ry :i^ 1 z m:ili-.: enzym^'S. ihe inzreaaed aize of. bbb ^nd TME reaidec in 
^ hv' e rcTu.n a 1 e:-:t; •^■r.a i.-: ric i'Zri sr-.? zirriib^r- in ::;e'q .^-E-n ze tZ' 

p[i->zr'h'Ztransazetyizze enzynes 'EC J . z . 1 . 3 ) . Mzdified jME -r.d THE pr^zteins 
v;hi..'.-h la'.'b this b-terminal r-eai-jn retain maliz ^;nzyz:e acticity but are 
^.n:ih'^e tt* jliqcnierize intz the native- state, bita r)ase sea.rcY.e.s have 
rev'v'iled vhat sizizlar chizuerdc ziali't enzyiiiez v;'E're ar.iquely present in 
-■^r iZ'.-r.eg-^it ive rz-:teri.a. Tnaa Z'>jb an^:: TME appeiir tC' be memr:iers of a new 
jl.iss of nializ enzzy:ne c-ha ract e r i ze d t-y the presenze of a 
r-.h .;z.pnotr insnzetyl-^ae-l ike aomai.n at tne b terminal; of the protein. 
Che--:k Ta:^5: ZoTip ar a t zve Study; Support, Nzn-b.S. Gov't 

Aminz' A::ii Sequence 

Bazteria: EN, enzyrrolc-qy 



Bacter i a : I'^E, ;jeneti ::3 
Chimeric Proteins: BI , biosynthesis 
Chimeric Proteins: CH , chemistry 
Chimeric Proteins: ME, metabolism 
♦Evolution, Molecular 

*Malate DehydrC' genase : '"H, cheirii -5 1 ry 
M^filate L'ehydr'j genase : GE, ■■j'^netics 

Malate n-ehydrc: ;^:nase : mk, me t abC' L i sm 

I^rilecular ^eqaerice L'ata 

:'hy Ica-ny 

Sequence Alignment 

Sequence Homology, Ammo Acid 

: nC'T hi zC't' ium :uelil::ti: EN, enzyir.cd oqy 
: r.'M r. ^ z u^L rn-e^i il ;: t :l : GE, aenez.ios 
Software 

GM E! " 1 . 1 . 1 . --T (MaL<:;te jehydrc-genase ) ; EG 1 . ] , 1 , 3 ^) (maiate dehydrogeria5^e 
(i^;:caib'jx-/lat i:i ) ; EG 1 . I . I . (inaiate dehydr:?qena:^a ( oxaio-icer ate- 
d^cart'Cxy lating) {NAjP+)); 0 {Ghimeric Protein.-:) 
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GI r.£-ena^iJ-aeqt=;rieT-a*:e liy/jria cd i gc nucleot ioe ra imers fcr amp 1 l t icat iC'n of 

■ji^LdViXly reLi^ed oe ju^^aooc . 
Au r.'-----*^- T Gonult:: E E; EordKC'if E Ti; ? lecic^KCo/o < i G; iM'::GaLium G M; 

Ht::.! KC' f G 

GS [''--partmonr E' aohc-td 'ji'i- 5;^,', GohC'Cl of PuEdic I'ErsLth and Gonununicy 

*'l-dicirio, Gnivor^^iiy of VJac]"iirjJlcr:, Seattle, VJA 981 ?»5, GGA. 

EG NGo'IuElL' A':i:'3 P.EGEAF.GH, (1998 Apr 1) 26 (7) i-;.A^-35. 
G'-. r r.a L o<:-o*i- : 2' BE . IGGi i: ■"■:■]:- 1 7 4 3 . 

GY EliiJEANC' : '/x it-id r'i r-.O'T^jin 

GT -.M r...i ; Ar^LoIe; ( A^GENAE AP.TIE'IE! 

EA Er-iliar: 

ES rrior i^y .7:0 ma I s; Gari';or .Toarnalo 
EM 1:^^827 
EW 1 ■^28E7 0- 

AP. v;^r dosorit^o a nov; p.ririL-r doaig:: otraragy tor VC?'. ampjli float ic^n of unkn<awn 
t:arget5 that are relatoo ■*:':■ ir.oltiEdy- aligned [oriteir. segaenoes. Each 
r-rinier '::or:5lsts of a sh'i'rt 3' degenerate 0':re region and a longei 5' 
/■ansen.rus olanit' regi-^n. Only 3-4 highly c-jnseived aiulnc' acid residues are 
ne-r^essary fc-r design ot the oc're, which is stah'ilized by the ciamp during 
anneahing to template rr.olecules. During iater r:unds of ampl i f icat ic^n , the 
n':oi-deg^"'nerate clan.f' r:'^.^rmits stat-ie -annealing t: p^roduot mcvi'E'Oules . We 
-ieirionstrat e the practical utility of this hylari'ii primer method by 
■:ietection ot diverse r^-;verse t ranscr iotase- 1 i '::e genes in a naman genome, 
■and j:>y i:iet ect ic^n of G^LGIA iriet hy 1 1 rans f erase h^jii'O' Ic^gs in various plant 
L'NAs . In each case, ami-:difiea prc^ducts were sufficiently pure to be cloned 
v/itn^aut gel f r act i':nat 1 jn . This E'Ginsensus-EEgen'^rate Hyt-rid 
■■71 i g'jr.U'-:le'.at ide- F'rlnier- ( G7)[:'EH2)? ) strategy nas r:"::en im^alemented as a 
-r.-amput er- prc^gram that is accessltde over the VJ.iula Wide Wefci 
{ ht tj:' :/ /bio rks . fhcrc . org/7:;odehop) . html ) an-i is Liirectly linked from the 
Bhii-akMarier" ritultipl-E' se^juence aligrnment site f-jr hybrid pirimer predi::tic)n 
C'O^Tinning with a set of rebated protein sequenc^:'S . 

GT Gheck T-'igs : Animal; Human; Support, N:)n-U.S. G:'V't; Support, U.S. Gov't, 
P.fi.G. 
A-uifL-j A 2 id Seguen'i'e 

Arthritis, Rheumatoid : 3E, genetics 

F^ase Geguerice 

G:.djn 

Computer Communication Networks 

G-ansensus So:5uenc-: 

Ganserved Sequence 
^GNA M>adif ication Methylases: GH, shemist ry 

GNA Modification Methylases: GE, genetics 
■^-DNA Primers 
*Evolution, Molecular 
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i yrrier aoe Cham F.eactioT.: MT, niethcM'Js 
*RNA-Directed DNA Polymerase: CH, chemistry 

RNA-Directed DNA Polymerase: GE , genetics 

SaiC'irna, Kap^-'Si: hi], genetics 
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SeoroLng an anci^-n:' enzyme in Met hanC'C'j :cu3 jannaschii uoing OFF, a program 

baoed on pre<;licted oeocnctary .structure comparisoTio , 
Aur.^ra R; Pc-se L' 

Defjartmo-nt oi of 'h-yaics ana Bicpiny;: i ca 1 Chemistry, Johns Hopkins SchC"ji 

of MeairLno, ''2'.- liorth W.:d f e Co.r-eo, Falr.imore, m:j 2120:, USA. 
PrC'CEELdlCCF 'dih: NAT.[!.'NAL ACAT'EMY o'r' FCIENCES OF I'HE 'JNITED STATES (JF 



AMERICA, (1998 Mar 17) 

J-.inal coMie: P'/ 3 . ICCN; 

J-:u:nal; Ar::-1-; (.oOdP.: 
Engl i ah 

F'ri.Lidry Jr.a: rials; Cane* 

] 0 :^ ■■* '-' F 
1 0 ■" F '- - 



( ^^d 2 H c - 
00^7-3 F: 4 , 



JL'Urna 1 s 



W-:- nave aevel-p-d a oin.fde r-roc* 
g-n:-]id'.: dat ai:o;i s^s . Tne ^:uogri;im, 
pr -i-oict ed ^^eo-jn'iary st.r U':'t ur e . 
than imin-o a^:-id S';-o^ueno-e , am:! s:r 
in exp'lodting thiy- tact to fond n 
0'rguerr:;e ii.a»iui*. it y . oF.F is a secc: 
S'ldel y on predi'rt i 'Ons fr^'m segue: 



jie zo identify protein h'omo].ogs in 
il.le':; OFF, is teased on cijmpiai'i sc-ns of" 
:tt:dn struiiture is far better cC'nserved 
o.cr ure-tiased met nods have been ^-ffective 
i-mcd'irs, even amonq prc'teins with scant 
nai"/ St ructure-t'rised methC'd that op^erates 
re and reguires no experimentally 
aeter-mined intormation ain^'Ut the structure. The approacn is illustrated by 
an example: Tnymidylate synthase, a highly conserved enzyme essential to 
thymiiiine rd os ynth- s i s in t-ith p'r'i'karyot es and eukaryotes, is thc'ught to 
be used oy ArThaea, h'Ut i co rresp'Onding gene nas yet to be islentified. 
Here, a candidate thyinidylate synthase is identified as a previously 
unassignea op-en r-aaing fram.e fr^tm th^r genome of Me than ococcus jannaschii, 
viz., MC27t7. Usir.g |;;.rini^ry struzture in f ■jrm.at icn alzne, tne optimally 
all^;ned seguer.ce .deritity lietween M.JLt7 57 £ind Escherizhia 'ZC'li tnymidylate 
synthase is 7 , wod 1 below the thiesholi of sensitivity for detection by 
S'-guence -ijase'ii metri<:'as . 

i2h-<:Fr: T^igs: C^'->mpia native I'tuuy; Support, 70 S . Gov't, P.H.S. 

/Vrimo' Aci'-i Sequence 

Archaeal Proteins: CH, chemistry 

Ba^^teria: EN, '-nz ymodovT y 

Evolution, Molecular 
^Methano ::occus : EN, -onzymoltqy 

M.:.ciels, M:de':u.ar 

MolezuLar Segu-n--:e Tiata 
^Protein Structure, Secjndary 

Protein S t ru ::t iir-:-! , Tert iary 

Sac zha romyces ■■:e re vi s lae : EN, enzymzlogy 
* Sequence Alignment: MT , methods 

Sequence Homology, Amino Acid 
* Software 

' Thymidylate .-iynthase : CH, ::hemistry 

EC 2.1.1.4t C^^hymidylate Synthase); 0 (Archaeal Proteins) 
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An :::ppr' ^a^:h to oetecti-'Ti of pro^oin t root 'jroj i uiotits usirr'] rj^n encodinq 
.:.';:h6'iT.e -'.'f ba;^kt'OTie oon I orma t i ':r..s . 
M,':itsucia H; Tani-pjchi F; H.ashi ir^i't o A 

[top5ti tnivrnr. -j f I r: t <jrniat : cs ana Hat herna t ioa I S';:ionc<:^ f^adua"^^ f^ohool of 
Er;c i.rjoe i.- irrg 3::ionoe, ij. a^a Ur. i ver's : t y , Jdpari.. mar s aria @ ics . oo . c-Scika - 

aC. jp 

t'ACIFH: SVr^rAOIUM (7N 5 :(:)C0MP'JTING;, (1997) 2o0-ii. 
.T'jar nal o< nj-r : l'WC' . 
.;■ i n':;ar:": : o 

Journal; Arriol^-; AA:i''r.M/.^p AP.TICAE: 
Enqlion 

I'L i-Z'Z i*:. \- A. V : ^ na 1 




TrA a ;:ar'On p-.: osa-r.t an ci}:ajrL^acri detect icti of p^rotein struotoral mot its 
In our cippro-aoA, fii':it all pvrotein h-ackbone cc^nf '"_'rmao ions are converted 
\r.to chai act er strir.QS aaina an encc'dino s^-herrie. Theo'i we use tne 
.i'lni t h-Wat e rman Ici^al a 1 .1 qnraono algorithm to detec:t ccimnon structural 
irujtifs. By c-: mp-a.i i no; results with the Pr.OSITE reoulai expression p)atterns 
i:-ur methc-(.i -van detect sever. ^ I iroAifs v;hlch the FFAkBITE patterns tail to 
oet ect . 

Ahe-:-K T.-iOS : .ojp.p.rrt, K-.n-jA?. (Gov't 
Ajidnt Acsi'-j ;?eaoo-no^ 

Calcium-Binding Proteins: CH , chemistry 
Computer Simulation 
Crystallins: CH, chemistry 
Databases 

Databases , Factual 

Evolution, Molecular 

Information Storage and Retrieval 

Molecu±ai Sequence L)ara 
* Fr'jteiri ('Aonf orEiat i'tri 

F r o t e i n A t r u c t u r e , A e ■: - jnda r y 
♦Proteins: CH , chemistry 
♦Proteins: GE , genetics 

Secjuence Alignment 

Sequence Homology, Amino Acid 

Software 

0 (Calcium-Binding Proteins); 0 (Crystallins); C (Proteins) 
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EvcAutionary cc^nser vat i on and p^redicted sti-ucture of the L'rosop'hi la extra 
sex C'tmrv- rec«ress'tr fjr;tein. 
: I J ; L L P. ; 1^1 o r g a n t ; :7 i ni t n A 

Jeparrment -jf Bi ocheriii s tr y , Cniversity of Minnes-^ta, St. Paul GSlOB, USA. 
CH4 9cP0 (i:iCM3) 

MOLECALAF. AND CELLULAR Bl'jLijiGY, (1997 Nov) 17 (U) 6663-70. 
A.vjrnal coae : NAY. lASN: 0270-7 306. 
Nr.i^od At^*:es 

1-jurnal; Article; (JiaUr.NAP AF.TICLE) 
En-::lisn 

Pi iority v."^ j urna I s 

AEN3A^]:^-AFiAJ3 60 3; GENBANK-AFO 0 3 60 4 ; GENBANP:-AFOO 3 6 3 5 
: 998 01 

The I'ros'i'P'hiia extra sex ::omi:'S (ess) protein, a member of the Polycomib 
grcMjC' (Pr:^), i a transcriptional repressor of ntmeotic genes. Genetic 
stu:iies nave snowri that esc pr-jtein is reguired in early emLbryas at about 
trie t im.e thiat tt.ner Pc'3 prot.eins oeoome engaged in hcmeotic gene 
repression. The esc protein c^ansists primarily of mjultiple copies of the 
VJD repeat, a m^ttif that has t-een implicated m pr d t e in-protein 
interaction. Ta further investigate the domain organization of esc 



c^r'-'tein, v;e ^lave isr>lai"ed ';:.:ar._i f^r i ::i r-o'.:: hu::.c^^c.qs ircni diverqent 

iri;-e':;t 3p^-cie:?. We report that ^'^s-: prorem i.j riigrily ''onserv^'-ci in h'juse^tl. 

bu r I. e r i. .1, y (0 ■■ i ci^j n t i c : a i ) , a n c:i 
vj t hi a t: * : h b u 1 1 r f i ^' hi o rn c i. o q p i' o v i ci e s 



cr.issh-iopper ^ 



i<:;er.t i ca 1 ) . We she 
eS'- furi':-t .Li'>n ::; Dr'j;: c^phi la , indicatlnq thiat the sequence similarities 
reiiec" f ai\Gt, : 'i.ina 1 e'^r.servat ion . hlornc-loqy laodelinq usin'..;i the orystui 
::■ I j':;t a.r e of >-j.::jZh-L WO r^^rjeat p-rotein, the ■"^-protein h^e t .i - sabuni t , 
|-a-^dic:s tnat '.'SO r-rc-tein .^oiop-ts a oeta-prop^eller- stractui^e. Thie sequence 
C'T'iix^ar i SC'M.- and mo^ieling saq^jest that tnere are seven W!:> repeats in esc 



pr-.^tein wrnicn 



■ qe^nei forT. 



2^'-.-\'en-t} dded beta-ijrc 



locate tne 



: t L r ec e-i i'.r:t ■ 



is: ^ pei.er 
r.^nctic-n lc 
1 rq. er.'i :-t i i"^*n v; 1 1 h 

Cr L- T,^ 



u.r : .-J : e , i oent l :: . e^. 



€sc t.):o*:ein with respect to thiis cied:cted structure 
e-:is ^■■f sp*"ud f i>. bosq. nac:;c i ?t to ^^:r^■■nd frc;r:[ thi^j 
c^aiservec pa:rts of esc prcte;:n required ior 
th.:at: thiese reaior.s rraqht rr.edicite physis:al 



a d ■■ 



c pa rtn*;r r rc^t::e ns . 
II. <a L ; tu]: per ' , U . , Gov ' ' 



, H . 



^uppoit, U.S 
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rutterf li.^-i-s : dE, qenetics 

I ': C'ninqq [-'c^e'/ula i' 

Computer Simulation 

b', nse rveii Sequent:e 

b i"s. 3 n i b.n : bM, enii:-r yc.loay 
* bi-c^S':'[:nilu : i.ib, q--net i cs 

: MA, t ^-iT.p: ein^-nr a ry : , q^ ::■ rt i 
*Evolution, Molecular 

( :r assh'i'P'p^r s : ''iS, qe net ice 

iiv/asef I ies : db, (^enet ics 

Insect Proteins: CH, chemistry 
♦Insect Proteins: QE , genetics 

Insect Proteins: ME, metabolism 

lb: eels, :b-le.:.lor 

Ibidecular Seq^ience Da- a 

Mat aqenes i s , b ite-[ii rect i 

Fr'jtein bi.ndmq 

h rctein tT'jnf '■■r.iiat i-^n 
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Repressor Proteins: CH, chemistry 
♦Repressor Proteins: GE , genetics 

Repressor Proteins: ME, metabolism 

Sequence Analysis ; DNA 

Sequence Homology , Amino Acid 

bt ruct ure-Act ivity be I at i^'^nshi}:' 
] j4 H-C 1 -G {F'olycomb protein, insect) 

e (bNA, C:.-:Tipl-meritary) ; 0 (Insect PrC'teins;.; ( {r^epress-jr broteins) 

AtiGWEB. 17 '.iB' G?> MEDblME 
^7.1 ?.5tl t MEbblME 
'7 4 ?5 SIS 

{■oraec^boxes in sea anemones [ -^ni iar ia : Anthc-zoa ) : a PCR-t-asec; survey of 
Mem d t C'S te ^ 1 a ve::tensis ari'i Metridium seni.b"'. 
bmnerty b; M.^rtindale M Q 

bepartnient. -ot Orqanismal Bio^<jgy and Anatony, University or Ghicago, 
Illinois *:.q6:b"/, i.lSA. 
HbM7 1 36 (dIGHb) 

Bb")LG:^dGAb BdbLbTIN, (1997 Aug) 193 f.l) 6G-t(5. 
■ Gjurnal ciJe: Aid. IGSN: 0 1 1] r,- 3 1 o 5 . 
■Jnited St.:ites 

TS)urnal; Arti--:le; :;jGUbNAL Abl^IGbE) 
En 7lish 

Pric>ritv iourr.als 



GE!IBAnp;-U4G72G; aENBANK-d4 h 7 .;:7 

dEtIBA^b-^^b:I'3^^• bENBAri?*-U4i7 M 

dE;jBAnF:-lMi7 34 ; dENBAlI?'-U4 1 7 35i 

dE:iBAnK-M0j87 j ; dENEANK-S 1 S 5 4 8 

dE:JBAl]p:-D3704i ; 3ENBANb-b'bj6 r^O 



dENBANK-U4h7 2H ; 
d£bBANr'-U4 ^ / ; 
dENBANF:-U4 J73^-); 



dENBANb-U4 S72 9; 
3EHBANb-n4i7 33; 
dENEAN?'-U4t73 



d E 1 J 3 A N b - M b 1 8 7 ::. ; d E b E Ab b - D 4 4 6 2 9 ; 
dENBANb-E3';042; dEbBAbK-A3 3 5 1 1 ; 



GENBAriK-S: 3 ; 2;ENBAN?:-S7 22 2 :3 ; GEI:BANK-3 2G7 '7 2 ; GENBAI ]?'-Afj 0 0 ^-^2 ; 
2;E!JBA:JK->2: 21 H; 2EIjBAN?:-S2^ 2-:.; t 
1 -v^^vi.: 
1 ■r:ni201 

H-.'nie'jl'.)L'X qerie;? b^'-lonq to a p)hy 1 joenet ica 1 i y v;ido^-^pr^--ad farnily of 

T'-'f J . at '.)r-y ';ienes tiriac play i.m}jo rt ant roles in }:'3*;tern iorniati.on and 

■:;vl.L-2ate spe :: i f i c.j L iori iri several rriDdel systems (e.a., Droscpnila, mouse 

and ''A ^^d^-aansi. Altn-iugh the evc'luti-jn nf rnariy -"lasses jt homeobo:-. 'jssnes 

j:a.edjtes the :! Lvers i f 1 L"at i on ot the Bi. Later! a, :s:r[ipa r^c. : i vei y ii. 2^1e 

kn-jwn aijoat hirrieC'bS'X aenes in oatgroaps tC' the 3 i later! a, siic2i as the 

'"r. i r i a . l\tr j e :i t h t.-o 1 yirirr ra.s-- < :ha in r s;,:o:v: :. ":n z o re r 12 partial 

h-K.e' ;..)':'Xe.; hrsm .ipe-^ies -jf .^e;: .dneiT^jnes , Me'., i i'li u::i ssnile arid 

2- Tfia * ■.■s*: e.: 1 a vtrs*:.c:'.s is ( }slo/.l ...1:1 7a.idarA.a; svlciss Ar:thozoa). These horrieoisoxe 

ar^r'*-rjL- to represer:t ^ dis'oin-::^, iriataal. Ly p^ar a lo jS'US noire S'b ox genes, 2 or 

va.ion t^t:-.! zr. p-re viS'asl y identitred oni(:iarian nomeobS'X classes, and 1 o 

wn.n.:n ap£:'<:^:ir retJiesen- prevrS'Usly anident 1 liec:; classes. The 

fO"/'^!s■it i s-nary r ^i-l at ic nship'S between tne h-Direcidoiu s i ns of sea aneincines and o 

L-^la*- err an animals were assesse*:: thrsiugh oataba^e sear:Ti'E-s and 

f;r.y 2'-o;enet 1 c analyses. As many as 5 of the anemone homesiritaxes may t'elc^nn 

to trie Wox class, which suggests that the Hc-x gene csmrlement of 

i:-T:idarians is larger than prevlS'Usly expe'Sted. H-smologs ot the 

e'/en- s kr f-p O'C ^t^ns- ^d lAro - <jr'hil a wore also identrfied in t'tS'^h Metriaium cin 

lA-iua^. ''.-oo-. el 1 Si . 

fln^rCH: Tags: Animal; Suppsrt, t . y . G-jv ' t , Xon-P.H.G.; hapr^ort , U.S. Gov't, 



AmrriC' Acid Segu-nce 

' 'oens^r nal: ^: 1 s : GE, gene''oLs:s 
G 1 '.-ning, il: i eoo. lar 

Evolution, Molecular 
*'2erie3, Homes)r"jx 

Homeodomain Proteins: BI, biosynthesis 
*Homeo domain Proteins: CH, chemistry 

Homeodomain Proteins: GE , genetics 

Information Systems 

rUce 

Mji'.-.-'uiar Geguence Data 
^■'hyl. ogeny 

i'olymerase Chain Reaction: MT, methsids 
Recombinant Proteins: BI , biosynthesis 
Recombinant Proteins: CH, chemistry 

Gea A.nemC'nes: C2, 'Sla s s i f iisa": is^n 
A e -1 An emis n e 3 : G- E , g e net i c s 
Sequence Alignment 
Sequence Homology, Amino Acid 

Ge-queris:e :lsmsd.':'-;;y , MU'Sleic A^-.ia 
(Hjirietid^'miin Broteins); 0 ( Becombinant Proteins) 

AIIGWEB 18 t)E 2:; ME[iLIME 
;^7.:3 2 2 GO;^ MED2IME 
:^ '4 2 2 602 

'Ine .numan oeta-def ens in-l ind alfiha-def ensins are encoded by adjacent 
:i-n-s: tw-j r.iepti'de f and. lies v/i-^h differing disulfide tapjlogy snare a 
■S''>m]Tisin an :.e s try. 

Liu L; Zhas) G; Heng H H; G.anz T 

D-pirtment :.'f Medisine and Will Bogers Institute for Balm.onary Research, 
'_Bii ve rs i.t y ot Galiforn:.a -it BsiS Angeles School of Mearcine, 20095, GGA. 
Hh-'"G.';"i 14 ' I'JHLBI ) 
H2-V.64 0 G'JdLBI) 
H2-4f^809 ■MHLBI) 

lENGMICS, (1997 Aug 1) 42 (3) 316-2 0. 
Jjurnal code: GEN. ISSN: 0863-7543. 
United States 



['1 .^-nrnal; Ai^i^ie; r^l-.V^C^iL; 
LA E::<Uish 

Priority J'jurnals 
C'S (::[-;:iBAN?:-U5G'^?0; (:-ENBAKK-n509 31 ; GEIJBANK-X927 4 4 
EM 1 ^-^rOI 

AE. W^r ciL'ned a ri^jve 1 humari bet a-de f en 3 i n ''jene and rlet; e::ni i.no(i i^s full-lengthi 
'_-l'!IA ^:'quenr;e . Tr.e eri'ir*- gerie sriianned more than 7 khj and irirduded a large 
^y''*''l-i'p inzror,, 'I'r.e l~\:^p ::L-MA. enec'detd <i pr eprcjpept" ide that cc'rrosponded 

^-eii: e i y rc; ::"ie recen^.ly identifieo nurtan beta-def ensin llBD-l^ an 
oM ■ imi rc'l: i a 1 T^ep't .ld'=- ;mplieated in tne r-eaistance ot ef;ithelial surfaces 
Z'.- ozdal. v-olorA lori . 3y tw^i'-O': lor fMuC'rescence in situ hyljr idi za t ior. 

(!e,FBl in H'JL^t. / (^ri^ nc^nenr- la: ij re ; :na}:ped *: o chr cite sor-a 1 region c |:-2 3 . 1 -p2 , >: 

in i-lc;]e loiC'Xiirdty (within iL-J- I : J <o) t'j the gone for the hoioan 

neat r.;-!:. nil a Ir: na-. f.:^nsi r; HMi:;-l (IjEEAIi. Thus, despit- a ccjrr;pie-e la^-k of 

seqa-rnoe simi 1 ^ .rti t y and tjospite di : f <:'rerces in their disui fi de-pa iring 
p,-3r:e.rn, tne .-ilpha- au: Bet a- : airii lies <ippe<ur to have evoiveo from a common 
p r ernamnia 1 1 a n i:ie f r-- n s i n < ;e Tj e , 
Ct Cneok Tags: Human; 3':pxn:iz, i i . B . i3ov ' t , P.H.S. 

Algorithms 

Ainitii;.' A:' id ct.;:ouero::e 
^ Ar.1: i - I nf ect i ve Ag*;-n"s: MB, i[..et ah'O L i :ini 



♦Blood Proteins: GE , genetics 

'■.'n r-ciij: o : ite N.^r^p i no 

ti.r s-iriC'^: ■iir,es , Huir'ar:, Pa ir 3: GB, gen^ 

Databases , Factual 

: ..:ulf io^:S: BB, .:h^E-ia..e.: / 

BilA, Ci rT.p 1 tMii^rnta.ry : All, .::nalysii 

1 BA, Bur}: 1 em'-nta r y 

MIA, B'titp 1 H-m'-rtai y : 

Evolution, Molecular 
''^^^'nes, Btructural: GE, generics 

Iri Bitu Hybr idizat Il'Ti, Flaoi e:: oenoe 

li^do'iular Beguen'::t' E'ata 
■^Muitigene Fairdiy 

Peptides: CH , chemistry 

Peptides: GE , genetics 

Pr^i.teir: Btruoture, Beocndary 

Sequence Alignment: MT , methods 

Sequence Homology, Amino Acid 
CU 0 ;.:lef ensins) ; 0 ( Ant i - I n f ect ive Agents]; 0 (Bloc^d Proteins); 0 
( L'l sul f ides } ; 0 ( BBA, Ccmpleruerta ry ) ; 0 (Peptides) 

B:57 AUBWEP. 19 t'B 0 3 HEDBIIIE 
AB ■^^72 3 : 1 00 MEC'BIBE 
BB 972 331 33 

T7 f)}-'*: iiuurn s i;p'rti i ni]:' js 1 1 i iTi of pij^;ein structures: anibiquities and 

ir:'.r' 1 i :'a" i 'tris . 
AM F^U''^ 7> K; Siprd M B 

f3Bi B-nter for Ajuplied Mcdecular- Engineering, Bniverslty of Salzburg, 
Aust ra 11 a . 

BB F''hBTBB ABB BEBIBN, (1996) 1 (2) 12 3-32. 

:r-.-;iTiul ooae: CBB. IBBU: 1 3 ^ 'BZT 5 . 
'33 EB^B.ABfi: Bnited rungd'jn 
BT B.:rn^]; Artd-le; ( J :)'JRMAB ABB B3BE ; 
BA Er:glLsn 
FB Priority Jturnils 
EM l':^'.-7B7 
EW 1 .^ 7 3 C) 1 

AB EA'^BBBBBl IB : Teshrd. qa-s f':»r ■t.'Smp ai' is sn an:l optirrum super impos 1 1 ion of 
pr jtein structures are in riisfiensat'le tools, prtviding thie basis for 
s* stistioal analysis, m-oielin^, prediction and classification of protein 
fjlds. Observed similarity st structures is frequently interpreted as an 
inii::ation of evolutionary relatedness. A variety of advan::ea techniques 
are available, t)Ut so far thie important issue Df ur;igueness of structural 



upcr irnpo^ i t iC'ii Yias h^^n idigeiy neglected. We -r^et :n.t tC' irivestigate this 
issue h''/ I mpleraent ing an e f f ici^^rit cilg(:'rithim f :^ r str'Lctu.re s'lperimposi 1 1 cit. 
er..iL»iin' : loutine s em ones tor alternative i. grjuent s . RESULTS : The 
■al ' ('jrit hm ijased '^n r'f'tinrjm super irupr.s it i on jt s r U'::t u.r es ^.ind liynarnic 
p I ■ ' J ■ a rr jn i r I '"4 . The i m c ■ 1 e it. e n t a t i o n Is t e s r e < 1 a n r i v j 1 i ( J t e d a i n g p u ti) 1 i s hied 
re-su.l.ts. I r: particular; an au t c-nia t i c e 1 .jus i f i. ctj t ic^n ot all pij::e.in tolds 
ill a recerit I'elease the pr^i'tein d.Hta barik i;£ oe rd rmed . Tne results 
i" a ine'U are rlU'Sely relate<:l t pub 11 she'd da^ri, :-?iari'r i s inu 1 y , tor m.-iny 
pi'jteln [ja i r s alternative :: i i^rinriient s are obtaunoii. IT.ese alignnierits -nre 
i n'.l : St i n ^a 1 s hib I e 1 r. teinis ot nuriiber oi equLVcil--nt r'^^'sldues and ro'jt mean 
S'qaa.r'e ot sapei inip cisit ic-n, h'Ut tne respe^i'tive sets f equivalent 

r--:due ;t.:i i .r s r •? : :r,p 1 t e 1 y ai:ti:M_:t. A 1 1*;; r r^.i t i vg iiignir^nnts .liar'e sljservea 
ler all ]:^l■:■tein a rehl t e'l't j r^es , inclu'::i.ng mixed a 1 pi . / oe t u fcius. 
uilddS'JS I 'j:iS : SuC'e r iiTirosi t len sd C'ristein folds 1? f r-rq sent 1 ■/ .:;:nrd gu'tas , 
Idiis has several irrpd i ca t itins o:\ thfs int e.rpre t s : 1 (sn f t r uct r a 1 
similarity with .resrect tj evsd u.^ 1 ana r y rela^ *';one'S and 1" r-rtr.icts the 
ir-:.'^^ c: ,ntp 1 i cahd 1 1 1 y cf .: :^{:;"E i in p i.j;:;*rd structur-'S .:s\ statistical an.ilysii. 
In tiart icular , studies k^as'^i:: ori the im}t licit assunact ic>n that tptimuin 
s ..T'ei'lmt'Cs i t i S'n ot st ru'it ires is unique are bC'^ntd t :■ ise ir- i s ] eacii ng . 
CT i^'h-ck Tags: Suppu-rt, N<in-';.S. Gs^v ' t 
* Algorithms 

M'Ai n C' A c i d ;? e q u e n < t s- 

Fvaiuatitn Studies 

Evolution, Molecular 

Mauler s , il'id s- lu i a i' 

M'Ciecular- Sequen^s Sat a 

M-decuSaj Structur-r 

S: ijt. e i n F^dciing 

r'r'Cteir. c'tructur e, Seiic nCva r ;y 
*Proteins: CH, chemistry 

Proteins: GE , genetics 

Sequence Alignment 

Sequence Homology, Ammo Acid 
(:N 0 (Proteins) 

LI 57 AIISWER .S; OF 2? MSjLIIJE 
AN ;'V:.S4 61n MEliLIN:! 

:i;! -^'/ss^ 6s(^ 

I'i Ev'j lut i .sma ry ni'Stif and its k-islugical and st ru st i.r.:: 1 significance. 

AU Tateno Y; I fieo K; Imanishi T; Watanab^e H; Snd-s T; 'faniaguchi Y; Suzudi Y; 

Takahiashi Tsuri'Syama Pw* Ka\va 1 M; PSavvanishi "i; Na^tciu K; tioji'k'cri T 
CS S^jnter rc-r Infc-rmaticn Ei'SlLgy, Karicnal Institute :f Genetics, F^ishima, 

.Ja{:ian. . yt at ens- 13 genes . nig . ac . jp' 
SO JOflRMAL OF MOLECULAR EVOLUTION, (1997) 44 Suppl 1 S:.&-43. 

J'-.urnal code: J76. IS3!I: L.::SS-lr44. 
CY united States 

DT Journal; Article; ( JtiUP.NAL AF.TIOLE) 

LA Eng^^isn 

FS priority Tu-urnals 

EM 19■,"^7(:ltS 

EW 1 '^^ '.^ 7 0 S 

AB W- devel'Sf-5d a metht'-::! f'tr multic'ie alignment ':>f" p^rL^r^dn se:5uen:'es. The 
:T..:iin teiture cd this ru-'thc:! is that it takes ^: ne ev-: 1 ut i i^n ar y 
re lat icnsnip'S sf tne pr.steins in guestis-n int'i acoj^nt repeatedly fd)r 
►T'X'.^su: i un, utitii tne re 1 a t iu^nsh i p s and alignm-'nt results are iti agreement. 
v;e then ar-plied this meth-s:! t-s the data of tne int t-rn at i jn al [iNA sequence 
■:ia taoases , whisdn ar'e the niDSt c turpi r e he r.sive ani updated [)NA databases in 
the v/.orld, m order to estimate tne "evolutionary nistif" r-y extensive use 
■: 1 a suriercs'mputer . Though a fev; riroblemLS nee^iei t-o be s-:d ved, we sould 
-■s*:im.ite the lengtn of the m-ttifs in tne range of Sj to 2j0 and n^j acids, 
with aojut 6 J the msst frequent length. We then dis sussed their bislogical 
ani strustural s igrii f ic an-s.e . We believe that v;e are now in a t'Cisi+'ion to 
ana^yz.e SNA and protein nc>t c^nly in viV'S and in vitro but also in silico. 

CT ^:heck Tags: Animal; Human 
Arriins" Asid Sequence 
Amino Acids : GE , genetics 



Databases , Factual 

r)NA: GE, genetics 
*Evolution, Molecular 
*Proteins: GE , genetics 

Sequence Alignment 
♦Sequence Homology, Amino Acid 
RN 'h 07-4 -^-2 [ DNA) 
^''N ■■) (Amir:o A::ids); 0 (Prrteins) 

LI 57 AHSWER ...1 ■:)F 2i MEL-LIME 
AN It' 3 5-^1 [-lEC'LINE 

i-ii : 1 ^; : 

71 wi.ii'L-- arid i;ier.: an ev'j I ut i-'nary perspective n thf^ 1" ibrc^tj] -jst growth 

: -n:;t<.i 'EeJE; and FGF rereptiji' familie,:. 
AU E; Rc^n^ a rC'-^ t i E; rj:>ar'in P.; Ha.rtung H; c-oidfarL M; Eirnh^aum D 

r3 L-ibr.ra^:c ir-i- d ' <: r:cc:d'-g i-:- Mr 1 e :ul ^1 re , L.113 INEERM, 2 7 Ed. Lei P.oure, 13009 

M^rcej:ie, Erance. 
EC) .'f.iURIJAL OF KE'LEirULA? EVt)LnTTi7N, (1997 Jan) 44 (1) 43-^6. 

'■.■uiriai ■::':e:e : .;'7E. I33>^: Oji 2 -.E3 4 4. 
EV ''riitea Etates 

i2T .T''Urnal; Article; (JOfJEMAL ARTICLE) 

LA Encl i an 
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AB I-E^Es t t ibi 'I'td aat gre^wtn fact-jra) play major roles in a numtier e-f 

i'. vel '^fjinen: a i prccesaea. r.ecent ituditrs ■ j t cev-^rai h.irTian ciisoraers, anri 

■ n .:ar I'ont analyaia ■: f cere --n^}::<-oa7 and pic^o-E r t lea ed Lne 'JO rresponding 
r-'"-':*'jr'':b : nan" or<:-tei.na h^ve an^^wn that. EGE.: an'O ^lieii r: <z:cepLC'X- a.re 
L'lv.rai n^-nt 1 y iriv.idve'j in rne ':le\^e I orjinen t -jf: tne okeLe^al sy?t-'m in irLannjr.ais . 
W'/ ■::e'::.i: a r t:he 3-:-:iue nc-:'S ot tne nine enown manurL-iil i an ECIE.e , E'GEs Eroiu 

.■*aner ve rt --t: rat ea , and three additi'tnal aeqaen:es th.^ir we extracted from 
-n.isti.na d.jt abacas: hunian E3E ce^qaentea tnat we tencarively designated 

l'''7Eli:' and EGEll, ana an EGF sequence from Caencrhabd 1 1 i s elegans. 
Eiiailarly, we have c^mr-ared the secjcences ot che fe-ur EGE reoepitiji 
paialC'^s f'-'und in cncrdates 'witn fc'cr non-cn«jrcate EGE recerit (-> ^s . 
including 'tne recently identified in C. elegans. The comparison ot EOF and 
Ei'iE recept'jr seo/ienres in ver-tebrates and nijnverteh^rat es shciws thac the 
E'';e and EGE recep'tcr families have evcdved tnrcagh pnases of gene 
■riapd 1 ':at iC'iiS , <tne ot wnich miay have ccdn:*ided with tne emergence ot 
veitebrates, m relation with their new system of body scaffold. 
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AB In t o"^. r ^p'l'd-f , Hox gener are *:'S3'E-nt ial fc-r tht^ P'tupei ciganization and 
d'..-'/e Icr -n-'nt f axl.h.l st ract a : . Experinient s involving Hox gen-B 
i.ri.-.icni va^: i 'Ons have revealed cheir pa rt i ::"jlar 1 y importar.t functicins in the 
e.- *;. abl 1 .-fr.n.e:'. >: 1 iU'ji:.h': lo'-^i iil : r -^ni^ i 1 1 'i^ns withirL in^tiAii.e r ic series sucn as 
^r.e -i'h'ra i ooi.uinn. 'leieC'-^J fish shov^ a much simpler range of axiai. 

ftran^: -j.r ar i^end i.ca Rs r ) mo rpiRcd c-gi es , vjnioh preempt ^-d us to investigate tne 
n:i*OJlr^" f tn-- H'j>: .;.y.rteni i:: these Lower ve rt et;) rat es . Rf^-re, v;e snC'W that 
fish :....i-7e a 1 ana l.y c^f r:t-x g-o".»E-s, very ?0.milar- in bcJh. rjjinber ^ind general 
L'iofar; i :-at i 'jn , t that t et laco'ds . Expression studies, carried out with 
Ht'XL ar.d Ht'XA genes, sh.O'W*E'd that all vertebrates use the same general 

-loE'Kie , mv'tlving the <:;C' 1 i n^-ai' <::Ct i vat i t'n cd: gene expre-ssion in bcth space 
an'i tine. Ctiup^ar isc-ns Letwe^rn tetrapods ana fish aUow-d us to prcpose a 
rr.'j'i.iei which acccunts fc^r th^j p^rimary fum^tion c-f this gene family. In this 
n.'.-'iel, 3 few ancestral H'jx '.i^rnes were involved in the oetermi nar i c n of 
}:"-Ja;.ity i ri t Re tii ltc-s t i ve tra-::^ and were further rc^cruited in mC're 
elulojiate axial e;^ ri.:ct r e,;-: . 
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teins are C'ften derived '.v:Lthoat their 
act\^re of -z^ luil ::oiQp .1. ex is aVailarde, tne 
'^naily ana ene r q-r : i ■::a 1 iy significant i::ay n*"' 
'rxiezric interfacie d-:dace:.i fr^iTr, the spatial 
, even af^er <:i ^v roc tare is soivec, it 
^'/rrijin.:: a majc'r : hec^r ^-t : ':a i ..ind exper iiTient ai ac-il to local Iz-^ r-rc^tein 
: s.n or icaial intei faces and vjicerstand the role of their cc-nstituent 
neso'iaes. The e\'': let i canary tra:e methcd is a systematic, r r ar;sf:'ai ent and 
riovol ii^redi 'Tt i ve technique tiiat identifies active sites and functic'nal 
interfaces in p'rc-t^E-ins witi'i knt'wn structure. It is iaased on the extraction 
I J f functionally imp'ortant i*;;-sidues fr':m secuen'::e 'Conservation piatterns in 
:".':.ir.'.:- i':--UL'US ]::'rijf: ei ns , and o:\ ':h'::ii- mapir'inu c^nto zhe p)rc'te in surface to 
ue:-:'-rate :-l...oters icentifyinu func*:ionai inter f:ices. The S::2 ar.d SH3 
m'--aular signal irio <':i'jmains onsi r he DNA binding d.'::!main of the nuclear 
r.irm-ino rscepc:r-s r^rcviLie "f:ests for t.ne az::ura::y and validity c^f o-ur 
ir.et)-.od. In each case, the v< li^t ionary trace deiineates the functional 
-pd*;'.C'e and i d'3-rit i f i es resid\:es. i::riti:al t hi:. 'ling sp^s-c i f i si t y . B.i;sed on 
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iriat-r'tmsd e E-ules , as well as -tudies o: spec i f i <:- i ^: y in ]!i':d -cul ar 
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AB G 1 y^'L s.i.'la , which 'rl-'ive tne gl ycosiclic bond botv;oen a r:-a rbohydr a t o and 
rif.^:-^ r.or rnciet'/, have ii-E-on cLj33i.fied into o'/or 6^ fsinilios. Here, a 
vari':0:y ■.rcahj:' at a 1 1 r.a 1 t-':::":n i l^u-: o have l";^;--:! erri}.d^_^ y^_o:i CC' examine :hree 
h5::r,i 1 i e.:- Liup':a:o ant ir. :'nr;.::i .i^rjj .::L;nornial piithDlrgy witn the -in et 
::;eV'Ol :'p.ino a fiamewor e t jr f\i"ore r; :_::![3lcoy iT:odeiinj, expor ii^ent a I and 
L.o.n-.r ^*:aoiei.. EaiuLiy i i n-:.:: b.i«:^^'a t^a "'t er i a . :ind aionseal ►E'nzyn;'='S as well a^ 

d L a..:ir-o:"ia r i de i n^ 'j leran re L. 



d .3':Tin'5 re5p^:o:t ivei y . A statistdoal moiael, 



hid'.ien Ma.rk<jv m'loiel (HKI4), fcvr the farnily 1 qlyrosidase dtmain vr^-z traine 
ana a.?e<i a^; the hasi^ f-ir ■I'^'itX',^ ra" i ve examination trie O'jnae.rve'il and 
Variable sequence ana atruotaral fe.^iturea as we hi as trie phylogenetic 
re 1 at i<jnah ip'S t'etw^i-er: fair-.ily ir.einh'er;:: . Alth^^jgh the structures of f'jur 
tanaJy 1 a 1 yL^rs i oases nave lrjee:\ det errninerl, this i. ;= ^Yi"^ f i r.::i 'somj:) n rat i ve 
exaitdndt i <:n all th»:- se erzyrres . Aii.pects ^ n^ t. ,ire sin i que to opeoi fd c 
ins-rrih'^i; r s sr ssd: f anil i i es ( sut^s t :r at s l::indina L:;s.ps; as well tn-ss-s- C'jiianLS"! 
,;ill rr.eiTir'e rs (a b^eoa / a 1 n ^ ) h'a.rirel told) hao^o oeen dotinO'S. A'Stive site 
resiaues in sS'iT.e dsmains in o hjthiS and laot ase-ph bs ri z in h-y'drsdases diffe 
tisiii K.^-Kir-ej: s ana in c-ne instance may ;:)ind l^a: not sd.eas'e sar) £;t rat e . 

Th'i' fsar irs/siriant ar:d lasst l'.ianl\- ■sonae r'.'':-d resid:.es are iMjt lesidues 



na^' i" 



.1^ 



m r i ..^in^- 1 1 1 S'l". 



lilt ! 



o'lO 



a: e s t ab i i zar i.ori. . i^iuc: 



S ) 



c ma: 



1 ■j^:M".::mb:i e 



, , ..■,.^r.. i tarn: ^ 

.l.yS'.'S sma 1 ststraqe oissras-ra L^as^cnei' disea.se and Itr.jobe di.seas:e, 
t:"esf:.iecci V eiy . HMM-::-ased analysis, st ractuie prediction studies ansi 
ex iiTiiinat i n i:;if disease niuoati(:'ns reveal a <:ilyc Dsida se dc-mai n c:S'min:>n to 
these ow'S fandli-.-s that alsc' or:curs in sc^me tiasterial glycc^sidases . 
Similarities in tne ieactis>ns s:atalyzed t-y families SO and S9 are 
reflecoed in the presense cd" a strurcarally and functionally related 
(t'Ot a/alp'haj 8 t^arrel fsdd rebatea to that in family 1. 
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A rry.-rnorj, i-3 ti^r ':*;ier.t: ed z-:- ■^:^:^''^s.i ^ne s i ard f i cancr' bindina site 
s iara .Rj r.i t i es :vi^r.ir. :f ap er i s*:-d C'ra*.eia\ t hree-din.ens ic na i (30) struccures 
and aj:.-Ci.i i ^d ra >'A1 siirdla.r : r uct are.E i:\ the Prc>Rein Data R^nk. For- 
;: irra 1 a rd a i ^ r re^v;een :33 str^jrtares Ricking significant seqaencc 
s iir..i i a rd t y , the irr.p'Litant a i s t inct i'^n was made k-etv/een reiucte homc'lcvay (an 
oncierrt lui-a-n ,:;ncest ij.r ) an-:: ariaR^gy (iiaeiy r^envergence c-j a foLainq 
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nine sup^e r f Ids , the ac-::uracy c-f p'redicaions of binding site Ic'Caaicins is 
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AI: r^:. t i':'r.o gi-mg k^ack iri'-re tha;; 2j year^ .:riciw tridt r egic^ns in proteins 

vj.zh S'jr cle' r ed Ij-^c kL-jr.es '::an p-Lay ro.'e^ in tneir t-indinq to other 
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